Skip to content
Change the repository type filter

All

    Repositories list

    • livy

      Public
      Scala
      Apache License 2.0
      2000Updated Mar 4, 2025Mar 4, 2025
    • hadoop

      Public
      Apache Hadoop
      Java
      Apache License 2.0
      9k000Updated Mar 4, 2025Mar 4, 2025
    • spark3

      Public
      Apache Spark - A unified analytics engine for large-scale data processing
      Scala
      Apache License 2.0
      29k001Updated Mar 4, 2025Mar 4, 2025
    • nifi

      Public
      Apache NiFi
      Java
      Apache License 2.0
      2.8k000Updated Mar 3, 2025Mar 3, 2025
    • hudi

      Public
      Upserts, Deletes And Incremental Processing on Big Data.
      Java
      Apache License 2.0
      2.5k000Updated Mar 1, 2025Mar 1, 2025
    • iceberg

      Public
      Apache Iceberg
      Java
      Apache License 2.0
      2.4k000Updated Feb 28, 2025Feb 28, 2025
    • zookeeper

      Public
      Apache ZooKeeper
      Java
      Apache License 2.0
      7.3k000Updated Feb 28, 2025Feb 28, 2025
    • druid

      Public
      Apache Druid: a high performance real-time analytics database.
      Java
      Apache License 2.0
      3.7k000Updated Feb 28, 2025Feb 28, 2025
    • oozie

      Public
      Mirror of Apache Oozie
      Java
      Apache License 2.0
      473000Updated Feb 28, 2025Feb 28, 2025
    • trino

      Public
      Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
      Java
      Apache License 2.0
      3.1k000Updated Feb 27, 2025Feb 27, 2025
    • pinot

      Public
      Apache Pinot - A realtime distributed OLAP datastore
      Java
      Apache License 2.0
      1.3k000Updated Feb 27, 2025Feb 27, 2025
    • tez

      Public
      Apache Tez
      Java
      Apache License 2.0
      428000Updated Feb 27, 2025Feb 27, 2025
    • ce-utils

      Public
      Utility script requests as per user requests
      Shell
      0000Updated Feb 27, 2025Feb 27, 2025
    • ranger

      Public
      Mirror of Apache Ranger
      Java
      Apache License 2.0
      1k000Updated Feb 25, 2025Feb 25, 2025
    • hive

      Public
      Apache Hive
      Java
      Apache License 2.0
      4.7k000Updated Feb 25, 2025Feb 25, 2025
    • kudu

      Public
      Mirror of Apache Kudu
      C++
      Apache License 2.0
      653000Updated Feb 25, 2025Feb 25, 2025
    • flink

      Public
      Apache Flink
      Java
      Apache License 2.0
      14k000Updated Feb 25, 2025Feb 25, 2025
    • sqoop

      Public
      Mirror of Apache Sqoop
      Java
      Apache License 2.0
      586000Updated Feb 25, 2025Feb 25, 2025
    • Java
      BSD 2-Clause "Simplified" License
      0000Updated Feb 25, 2025Feb 25, 2025
    • Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.
      Java
      BSD 2-Clause "Simplified" License
      610000Updated Feb 25, 2025Feb 25, 2025
    • ozone

      Public
      Scalable, redundant, and distributed object store for Apache Hadoop
      Java
      Apache License 2.0
      524000Updated Feb 25, 2025Feb 25, 2025
    • knox

      Public
      Mirror of Apache Knox
      Java
      Apache License 2.0
      254000Updated Feb 25, 2025Feb 25, 2025
    • zeppelin

      Public
      Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
      Java
      Apache License 2.0
      2.8k000Updated Feb 25, 2025Feb 25, 2025
    • kafka3

      Public
      Java
      Apache License 2.0
      3000Updated Feb 25, 2025Feb 25, 2025
    • kafka

      Public
      Mirror of Apache Kafka
      Java
      Apache License 2.0
      14k000Updated Feb 25, 2025Feb 25, 2025
    • phoenix

      Public
      Mirror of Apache Phoenix
      Java
      Apache License 2.0
      1k000Updated Feb 25, 2025Feb 25, 2025
    • hbase

      Public
      Apache HBase
      Java
      Apache License 2.0
      3.3k000Updated Feb 25, 2025Feb 25, 2025
    • Multi-user server for Jupyter notebooks
      Python
      Other
      2k000Updated Feb 24, 2025Feb 24, 2025
    • TPC-DS Kit for Impala
      Smarty
      Apache License 2.0
      156000Updated Feb 19, 2025Feb 19, 2025
    • impala

      Public
      Apache Impala
      C++
      Apache License 2.0
      519001Updated Feb 13, 2025Feb 13, 2025