{% if page.description %} {% endif %} {% if page.redirect %} {% endif %} {% production %} {% endproduction %} {% if page.url == "/" %}

Apache Spark - A Unified engine for large-scale data analytics

Apache Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, pandas API on Spark for pandas workloads, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for incremental computation and stream processing.
{% endif %}
{% if page.url contains "/ml" or page.url contains "/sql" or page.url contains "migration-guide.html" %} {% if page.url contains "migration-guide.html" %} {% include nav-left-wrapper-migration.html nav-migration=site.data.menu-migration %} {% elsif page.url contains "/ml" %} {% include nav-left-wrapper-ml.html nav-mllib=site.data.menu-mllib nav-ml=site.data.menu-ml %} {% else %} {% include nav-left-wrapper-sql.html nav-sql=site.data.menu-sql %} {% endif %}
{% if page.displayTitle %}

{{ page.displayTitle }}

{% else %}

{{ page.title }}

{% endif %} {{ content }}
{% else %}
{% if page.url != "/" %} {% if page.displayTitle %}

{{ page.displayTitle }}

{% else %}

{{ page.title }}

{% endif %} {% endif %} {{ content }}
{% endif %}