Backlog¶
Iteration +0¶
Folksonomy – define categories
File formats
Filesystems (local vs. remote vs. cloud storage)
Databases
Data warehouses
Streams
Services (System vs. Cloud/SaaS?)
Open table formats
Misc
Cloud storage
Pipeline frameworks
Metrics, telemetry, logs
Iteration +1¶
Documentation: Rework section “supported-sources”
Documentation: Advertise OCI image
Code layout: Refactor
omniload.srcintoomniload.x.{category}Tutorial “CSV to Elasticsearch” https://github.com/bruin-data/ingestr/issues/487
Tutorial “Parquet to CrateDB”
Tutorial “HDFS to CrateDB”
Iteration +2¶
Emphasize and improve file-based formats (CSV, JSON, Parquet, BSON, etc.)
Refurbish
example-urissubcommand
Iteration +3¶
Add scheduling unit
Add notification unit
Validate support for CrateDB https://github.com/bruin-data/ingestr/pull/284
Support streaming sources https://github.com/bruin-data/ingestr/issues/282