-
Notifications
You must be signed in to change notification settings - Fork 57
Data Sources and Targets
VDK integrates Data Sources and Targets via Plugins. Versatile Data Kit supports querying multiple databases using its Plugin framework.
Warning: This plugin is considered still pre-alpha plugin. A lot of changes or even being discarded are possible.
The data-sources project is a plugin for the Versatile Data Kit (VDK). It aims to simplify data ingestion from multiple sources by offering a single, unified API. Whether you're dealing with databases, REST APIs, or other forms of data, this project allows you to manage them all consistently. This is crucial for building scalable and maintainable data pipelines.
A relevant integration for data sources is vdk-singer
which provides integration with singer.io and provides connectors for a lot of APIs. It provides a separate command for listing the available sources with command vdk singer --list
Taps extract data from any source and write it to a standard stream in a JSON-based format:
- Amazon S3 CSV
- LinkedIn/Google/Facebook Ads
- Ebay
- GitHub
- Google Analytics
- Google Sheets
- Jira
- MySQL
- ORACLE
- PostgreSQL
- SFTP
- Salesforce
- Slack
- Trello
- Zoom
- ...
- File
- HTTP
- Huggingface
- PostgreSQL
- Impala
- Trino
- Greenplum
- SQLite
- Oracle
- DuckDB
- ...
You can see all supported plugins here.
Open a new ticket if your source/target is missing.
You can always develop your own plugins and contribute them to our repository so other people can reuse your code.
➡️ Next Section: Properties
SDK - Develop Data Jobs
SDK Key Concepts
Control Service - Deploy Data Jobs
Control Service Key Concepts
- Scheduling a Data Job for automatic execution
- Deployment
- Execution
- Production
- Properties and Secrets
Operations UI
Community
Contacts