How to Enable EMRFS consistent view for AWS data pipeline - isgaur/AWS-BigData-Solutions GitHub Wiki

In order to enable consistent view, there is an option in EMR resource in data pipeline to add configuration [1] where you can add EMRFS consistent property so that it would be enabled on EMR.

To add configurations to the EMR Cluster:

1. Click the drop down menu next to "Add an optional field" and then select "Configuration" from the list. (Screen-shots attached)

2. Then select "Create new: EmrConfiguration".

3. Click "EmrConfiguration DefaultEmrConfiguration 1 "in the flowchart, specify the classification as "emrfs-site" and then select "Property" under "Add an Optional Field" drop down.   

4. Under property select "Create new: Property" from drop down. 

5. Now select "Property DefaultProperty1" from flow chat and then give property values as show. 

Key:  fs.s3.consistent 
Value:  true

Reference:

  1. EMR config https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-object-emrconfiguration.html
  2. https://docs.aws.amazon.com/emr/latest/ManagementGuide/enable-consistent-view.html