This project provides extensions for kafka-connect-hdfs project.
The project provides an additional HDFS connector format: ru.dgis.casino.plain.GzipTextFormat
.
GzipTextFormat
writes each record value as String
separated with \n
and also performs compression via GZIPOutputStream
.
Here is an example of a connector config:
connector.class=io.confluent.connect.hdfs.HdfsSinkConnector
format.class=ru.dgis.casino.plain.GzipTextFormat
partitioner.class=io.confluent.connect.hdfs.partitioner.TimeBasedPartitioner
path.format=YYYY/MM/dd
name=SOME_NAME_HERE
topics=TOPIC
hdfs.url=hdfs://YOUR_HADOOP
logs.dir=LOGS_DIR
topics.dir=TOPICS_DIR
flush.size=100000
locale=ru_RU
timezone=Asia/Novosibirsk