Total experience of almost 7 years & 2 months in IT, 4 years & 2 months of experience in Big Data Technologies, and around 3 years of experience in Automation using Selenium java.
• Good knowledge of Hadoop ecosystem, HDFS, Big Data, PySpark.
• Worked on PySpark and have knowledge of Spark Architecture.
• Worked on PySpark RDD, DataFrame, SparkSQL
• Worked on AWS Cloud, S3 Bucket, EC2
• Optimization in Spark.
• Working knowledge of Python.
• Hands-on Experience in working with ecosystems like Hive, Sqoop, Map Reduce.
• Understanding of Partitioning and Bucketing in Hive.
• Working Knowledge of Hadoop Cluster architecture and Hive Architecture.
• Worked on basic SAS.
• Efficient in building a hive, pig, and map Reduce scripts.
• Implemented Proof of Concept on Hadoop and different big data analytic tools, migration from different databases to Hadoop.
• Loaded the dataset into Hive for ETL Operation.
• Experience in using DBvisualizer.
• Good analytical and technical skills with strong interpersonal, written, and verbal communication skills.
