Spark SQL random things - ignacio-alorre/Spark GitHub Wiki
This needs to be segregated and modify later
The filter and so on we were using.
.withColumn("PERSONAL_DATA",when($"DATA_TYPE".equalTo("StringType"), potentialPD(col("name"))))
or with ===
Take also as example
Dummy Data Frame also
//////////////////////////////////////////////
https://spark.apache.org/docs/2.3.0/sql-programming-guide.html#creating-dataframes