Uploaded image for project: 'Teiid'
  1. Teiid
  2. TEIID-3442

Apache Spark support via SparkSQL and DataFrames

XMLWordPrintable

    • High

      Eliciting comments for Apache Spark support. With the release of Panda's like DataFrames, it is a little more feasible to directly translate to SparkSQL:

      https://spark.apache.org/docs/latest/sql-programming-guide.html

      Options in order of complexity:
      1. Use the existing Hive connector / translator. Spark still uses the Hive metastore.
      2. Thrift JDBC driver. This is what Microstrategy, Tableau, QlikView and others use, most rudimentary API for accessing Spark.
      3. Native SparkSQL via building Spark jobs and submitting them to a running Spark driver.

              kylinsoong.1214@gmail.com Kylin Soong (Inactive)
              blue666man_jira John Muller (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved:

                  Estimated:
                  Original Estimate - 20 weeks
                  20w
                  Remaining:
                  Remaining Estimate - 20 weeks
                  20w
                  Logged:
                  Time Spent - Not Specified
                  Not Specified