apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about this month.
Apache Spark - A unified analytics engine for large-scale data processing
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Source code for Twitter's Recommendation Algorithm
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
Rocket Chip Generator
♞ lichess.org: the forever free, adless and open source chess server ♞
The Scala 3 compiler, also known as Dotty.
Spark: The Definitive Guide's Code Repository
ZIO — A type-safe, composable library for async and concurrent programming in Scala
State of the Art Natural Language Processing
Chisel: A Modern Hardware Design Language
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Solutions to exercises from the book "Scala for the Impacient" by Cay S. Horstmann.
The repository for the free Scala at Light Speed mini-course
Open, Modular, Deep Learning Accelerator
Scala language server with rich IDE features 🚀
Lift Java Transaction API integration
works
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
In-memory dimensional time series database.
An open protocol for secure data sharing
Network components (NIC, Switch) for FireBox
Open-source high-performance RISC-V processor