Twitter Streaming - pfriesch/onepercent GitHub Wiki
###Persist Data The Flume Agent persists all Twitter Tweets on the HDFS with the following scheme:
<yourFolder>/YEAR/MONTH/DAY/HOUR/*
Already persisted data ends with *.data, current written data ends with *.tmp
You can modify this behavior for sure.