Data Ingestion
6 building blocks and models in the data ingestion category.
Batch Data Source
Historical data from databases, data lakes, or files
Streaming Source
Real-time data from Kafka, Kinesis, or event streams
API Endpoint
REST or GraphQL API for data ingestion
File Upload
Upload files (CSV, JSON, Parquet, images)
Webhook
Receive data via HTTP webhooks
Web Scraper
Extract structured data from websites for ML training and RAG pipelines