Apache Software Foundation
The Apache Software Foundation (Wikipedia) is a decentralized open source community of developers. The software they produce is distributed under the terms of the Apache License, a permissive open-source license for free and open-source software (FOSS). The Apache projects are characterized by a collaborative, consensus-based development process and an open and pragmatic software license, which is to say that it allows developers, who receive the software freely, to redistribute it under non-free terms. Each project is managed by a self-selected team of technical experts who are active contributors to the project.
Apache AGEโ
Apache AGE is a
PostgreSQL
extension that provides graph database functionality.AGE
is an acronym forA Graph Extension
, and is inspired by Bitnineโs fork ofPostgreSQL 10
,AgensGraph
, which is a multimodal database. The goal of the project is to create single storage that can handle both relational and graph model data so that users can use standard ANSI SQL along withopenCypher
, the Graph query language. The data elementsApache AGE
stores are nodes, edges connecting them, and attributes of nodes and edges.
See more about integrating with Apache AGE.
Apache Cassandraโ
Apache Cassandra is a NoSQL, row-oriented, highly scalable and highly available database. Starting with version 5.0, the database ships with vector search capabilities.
See more about integrating with Apache Cassandra.
Apache Dorisโ
Apache Doris is a modern data warehouse for real-time analytics. It delivers lightning-fast analytics on real-time data at scale.
Usually
Apache Doris
is categorized into OLAP, and it has showed excellent performance in ClickBench โ a Benchmark For Analytical DBMS. Since it has a super-fast vectorized execution engine, it could also be used as a fast vectordb.
See more about integrating with Apache Doris.
Apache Kafkaโ
Apache Kafka is a distributed messaging system that is used to publish and subscribe to streams of records.
See more about integrating with Apache Kafka.
Apache Sparkโ
Apache Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including
Spark SQL
for SQL and DataFrames,pandas API on Spark
for pandas workloads,MLlib
for machine learning,GraphX
for graph processing, andStructured Streaming
for stream processing.
See more about integrating with Apache Spark.