Big Data, Realtime Analytics and Artificial Intelligence have been my passion for the last many years. As part of my PhD research, I have developed a Linear Programming based algorithm called ATSRA to allocate cluster resources to process Big Data. Currently I am working on a project with Canadian Immigration to develop a predictive model where state-of-the art Random Forest algorithm is used for AI.
Following is a summary of my expertise:
– AI Algorithms
– Programming skills in R, Python and Scala
– Advanced Mapreduce programming
– NoSQL databases, including MongoDB, HBase and Cassandra
– Apache Spark based Realtime Big Data Analytics
– IBM SPSS Modeller,
– Practical experience in clustering, classification, time series analytics, associative analysis & collaborative filtering.
– Multitenant Hadoop clusters ; and
I am also a Big Data Instructor and Course Editor for Learning Tree International. I teach Big Data courses in major North American cities and client sites. I am also involved with various research communities and the use Big Data for Artificial Intelligence is my primary research focus. Currently I am working on Real time processing of social media data using Apache Spark clusters.