Project - Sales Orders BI
Description - Sales Orders Project will combine subsets of data from systems containing Sales Orders information and stage it in a format that can be easily interrogated by the business users. The BI solution will provide a consolidated view of the captured Sales Orders information, answering questions. The Users will have the ability to report sales inventory quantity and value in each country, where available.
Global Supply Chain (GSC) organization
The Global Supplier BI of Xerox manages the invoice amount, amount distributed and payment amount. The individual teams under this group GSCB, XE-OMAF, OTC, OCF, NCTH and ACS need reports for analysis and decision making. GSC BI application is to mitigate the complexity by providing a single source instead of multiple sources, cleansed onestop integrated data warehouse that is flexible enough to accommodate new changes and scalable enough to efficiently cater to the growing user community. GSC BI application provides the flexibility for the Xerox business users to take key management decisions for the improvement of their business.
Programming Languages - Python, Java, C#, C
Python Libraries - Numpy, Matplotlib, Pandas
Algorithms - Linear Regression, Logistic Regression, Decision Tree, SVM, Naive Bayes, KNN, K-MeansDatabases - Oracle, MySQL, HBase,Mongo DB
Operating Systems - Windows, CentOS, Debian, Ubuntu
Tools and Software - Pycharm, Eclipse, Jira, HLM, GIT, SVN,Cosmos, Microsoft SQL Server data tools, SQL server BID Studio 2008, SQLServer Management Studio 2012/2008, SharePoint Server 2010,Performance Point Server 2010, Power BI
Hadoop Ecosystems - HDFS, MapReduce, PIG ,Ambari, HBase, Zookeeper, Hive, Sqoop, Oozie and Flume.
Sentiment Analysis of Twitter data using Naive-Bayes Classifier
the tweets regarding the top candidates in the 2016 election to predict
the public sentiments towards each presidential candidate. The project
attempts to solve the real world problem of understanding sentiment
towards election candidates based off the public live opinion and
emotions rather than off of smaller, localized polls typically done by
mainstream media corporations.
Organization Manager is a Web-based application with data feeds to/from other third-party or Home Office applications, including CRM applications. Organization Manager Application enables users to manage Company structure, Employee profile information and assignment to Sales Areas, Realignment Lists/Rules, Products, Instructions, Jobs and Scenarios for data processing and Data Loading. In this project by using Sqoop data is migrated from Oracle DB and MySQL DB which is provided by IMS Health, there are two data sets one is master data and another one patient's demographic data, in master data we have patient details and in demographic data we have prescription details and disease details. Some transformations on those two data sets and created hive tables and HBase tables, on top of the tables and used elastic search and the results given to the organization manager.
Restaurant Management System- Advanced Management system
Designed a database which held information on restaurants, its features,chefs, customer details and customer reviews. Integrated with SSRS for providing visualization. Performed normalization, data generation,loading, clustering, indexing and partitioning.
Companies Worked For:
Job Titles Held: