Continuous real-time data replication and integration
Hadoop Distributed File System (HDFS)
HDFS is the primary data storage system used by Hadoop applications.
HVR support for HDFS
Files can be captured and copied or moved to a different location. CSV and XML files can be processed for a table target. As a target, HVR can write files in multiple formats including Parquet, JSON, Avro, CSV or XML with many options to fine tune the format and define compression. Compare is supported through Hive external tables, or directly by reading/parsing the files.