spark-relational (homepage)

Document-oriented to relational data conversion in Spark

@am-giordano / (0)

spark-relational is a Spark package for converting a DataFrame with StructType and ArrayType columns into a set of DataFrames with flat columns interrelated by foreign keys.


Tags (No tags yet, login to add one. )


How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages am-giordano:spark-relational:0.3.1

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "am-giordano/spark-relational:0.3.1"

Otherwise,

resolvers += "Spark Packages Repo" at "https://repos.spark-packages.org/"

libraryDependencies += "am-giordano" % "spark-relational" % "0.3.1"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>am-giordano</groupId>
    <artifactId>spark-relational</artifactId>
    <version>0.3.1</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>https://repos.spark-packages.org/</url>
  </repository>
</repositories>

Releases

Version: 0.3.1 ( d466ec | zip | jar ) / Date: 2022-08-18 / License: Apache-2.0 / Scala version: 2.12

Version: 0.3.0 ( 1714a8 | zip | jar ) / Date: 2022-08-05 / License: Apache-2.0 / Scala version: 2.12

Version: 0.2.1 ( db6fff | zip | jar ) / Date: 2022-07-31 / License: Apache-2.0 / Scala version: 2.12

Version: 0.2.0 ( f30c09 | zip | jar ) / Date: 2022-07-31 / License: Apache-2.0 / Scala version: 2.12

Version: 0.1.1 ( edfb42 | zip | jar ) / Date: 2022-07-29 / License: Apache-2.0 / Scala version: 2.12

Version: 0.1.0 ( 30c0b3 | zip ) / Date: 2022-07-25 / License: Apache-2.0