Interview questions ? - prabhu914/Hadoop-Interview-Question GitHub Wiki

CGI Interview questions?

1)Exact Business of Humana? What Humana is expecting from the final output? 2)How to delete duplicate records from the DataFrame? 3)How to join two dataframes with leftoutjoin in Spark? 4)If Kafka Consumer missed to consume the data, how to handle this? What will happens to missed data and solution? 5) How to replace string in Unix? 6)In Oozie workflow what is the parameters and where should we define them? 7)In Oozie workflow where would we define any Hive or Ping actions exactly?