ClickHouse® is a real-time analytics DBMS
-
Updated
Jun 8, 2024 - C++
ClickHouse® is a real-time analytics DBMS
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
An open source time-series database for fast ingest and SQL queries
Seamless multi-master syncing database with an intuitive HTTP/JSON API, designed for reliability
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
SageWorks: An easy to use Python API for creating and deploying AWS SageMaker Models
Scalable, redundant, and distributed object store for Apache Hadoop
A High-Performance Data Science Toolkit for the Earth Sciences
北京交通大学计算机与信息技术学院系统与网络实验室 https://fangvv.github.io/Homepage/
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
Apache DataFusion SQL Query Engine
A collection of my data science journey - projects, code, and notes.
BlockMesh, is an innovative, open and secure network that allows you to easily monetize your excess bandwidth. Giving you a great opportunity to passively profit and participate in the frontline of AI data layer, online privacy, open source and blockchain industries.
The Open Source Feature Store for Machine Learning
AI + Data, online. https://vespa.ai
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."