About 37,100 results
Open links in new tab
  1. Apache Spark™ - Unified Engine for large-scale data analytics

    Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

  2. Overview - Spark 4.1.0 Documentation

    Apache Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution …

  3. Downloads - Apache Spark

    Download Spark: spark-4.1.1-bin-hadoop3.tgz Verify this release using the 4.1.1 signatures, checksums and project release KEYS by following these procedures. Note that Spark 4 is pre …

  4. Quick Start - Spark 4.1.0 Documentation

    Quick Start Interactive Analysis with the Spark Shell Basics More on Dataset Operations Caching Self-Contained Applications Where to Go from Here This tutorial provides a quick introduction …

  5. Documentation | Apache Spark

    Hands-On Exercises Hands-on exercises from Spark Summit 2014. These let you install Spark on your laptop and learn basic concepts, Spark SQL, Spark Streaming, GraphX and MLlib. Hands …

  6. Examples - Apache Spark

    Apache Spark ™ examples This page shows you how to use different Apache Spark APIs with simple examples. Spark is a great engine for small and large datasets. It can be used with …

  7. Spark SQL & DataFrames | Apache Spark

    Spark SQL includes a cost-based optimizer, columnar storage and code generation to make queries fast. At the same time, it scales to thousands of nodes and multi hour queries using …

  8. PySpark Overview — PySpark 4.1.0 documentation - Apache Spark

    Dec 11, 2025 · PySpark Overview # Date: Dec 11, 2025 Version: 4.1.0 Useful links: Live Notebook | GitHub | Issues | Examples | Community | Stack Overflow | Dev Mailing List | User …

  9. Spark SQL and DataFrames - Spark 4.1.0 Documentation

    Spark SQL, DataFrames and Datasets Guide Spark SQL is a Spark module for structured data processing. Unlike the basic Spark RDD API, the interfaces provided by Spark SQL provide …

  10. Getting Started — PySpark 4.1.0 documentation - Apache Spark

    Getting Started # This page summarizes the basic steps required to setup and get started with PySpark. There are more guides shared with other languages such as Quick Start in …