roles - veeraravi/Spark-notes GitHub Wiki

I have total 9+ years of experience in IT, more than 5+ years of work experience in ingestion, storage, querying, processing and analysis of Big Data with hands on experience in Hadoop Ecosystem development including Map reduce, HDFS, Hive, Pig, Spark-Core, Spark-Sql, Spark-Streaming, Kafaka, HBase, ZooKeeper, Sqoop, Flume, Oozie and AWS.  Strong work experience with Ingestion, Storage, Querying, Processing and Analysis of Big data.  Extensive experience with Data Ingestion and applying the transformation using Apache Spark framework, and Hadoop ecosystem components.  Expert in working with Hive data warehouse tool - creating tables, data distribution by implementing partitioning and bucketing, writing and optimizing the HiveQL queries.  Good Knowledge on Hadoop Cluster architecture, monitoring the cluster, managing and troubleshooting Hadoop related issues.  Good experience with NoSQL databases like HBase and Cassandra.  Experience using Kafka cluster for Data Integration and secured cloud service platform like AWS and doing Data summarization, Querying and Analysis of large Datasets stored on HDFS and Amazon S3 filesystem using Hive Query Language (HiveQL).  Expertise in implementing Spark, Scala application using higher order functions for both batch and interactive analysis requirement  Expertise and strong understanding with Object-Oriented Programming (OOP) concepts in JAVA and SCALA  Worked on developing SPARK Applications using Scala and Java, Spark Core, Spark DataFrames, Spark SQL and Spark Streaming API’s for fast processing of Data.  Experience in developing Map-Reduce programs and custom UDF’s for data processing using Java and Scala.  Involved in Creating Hive Tables and load processed data into tables and fine-tuned performance using different optimization techniques.  Involved in creating HDINSIGHT cluster in MICROSOFT AZURE PORTAL, EVENTSHUB and AZURE SQL DATABASES.  Worked on a clustered Hadoop for Windows Azure using HDInsight and HORTONWORKS Data Platform for Windows.  Strong expertise on Amazon AWS EC2, Dynamo DB, S3, Redshift, and other services and hands on experience on Hadoop /Big Data related technology experience in Storage, Querying, Processing and analysis of data.  Expertise in various JAVA/J2EE technologies such as JSP 2.0, Servlets 2.x, Struts 1.2/2.0, Hibernate 2.0/3.0 ORM, Spring 2.0/3.0, JDBC.  Proficient in Core Java, SCALA, J2EE, JDBC, Servlets, JSP, Exception Handling, Multithreading Concepts.