Do I have to set up HDFS in order to use streamX?

I noticed I have to configure the hadoop config files like core-site.xml, hdfs-site.xml to configure S3.  And I could not find the mentioned config/hadoop-conf in my installation (Kafka 0.10.2.0). So do I have to use HDFS in order to use this streamX?

What I am trying to do is to transform some messages in JSON format to parquet and then store them in S3.

Using spark could achieve this target but it would require a long-running cluster to do, or I can use the checkpoint to do a per day basic ETL.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do I have to set up HDFS in order to use streamX? #60

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Do I have to set up HDFS in order to use streamX? #60

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions