Popular repositories Loading
-
flink-crawler
flink-crawler PublicContinuous scalable web crawler built on top of Flink and crawler-commons
-
-
-
cascading.avro
cascading.avro PublicForked from clizzin/cascading.avro
Cascading Scheme for the Apache Avro data serialization format
-
-
Repositories
Showing 10 of 32 repositories
- pinot Public Forked from apache/pinot
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
ScaleUnlimited/pinot’s past year of commit activity - flink-crawler-ccdemo Public
Demo of using flink-crawler to extract pages from Common Crawl for a target language
ScaleUnlimited/flink-crawler-ccdemo’s past year of commit activity - cascading Public Forked from Cascading/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms. Please see https://github.com/cwensel/cascading for access to all WIP branches.
ScaleUnlimited/cascading’s past year of commit activity - fastText Public Forked from facebookresearch/fastText
Library for fast text representation and classification.
ScaleUnlimited/fastText’s past year of commit activity