Senior System Engineer/IT Manager - Linux Seeking a challenging role in Linux/Hadoop System Administration/Management position where I can utilize my Linux/Hadoop skills to further the growth of the firm and prosper in an intellectual technology driven environment. Accomplished systems administrator with 7.5 years of experience managing server infrastructures and data-center operations across different flavours of Linux. Effectively plan, install, configure and optimize the IT infrastructure to consistently achieve high availability and performance.
*Proven ability to create and deliver solutions tied to business growth, organizational development and systems/network optimization. Skilled problem identifier and troubleshooter comfortable managing systems, projects and teams in a range of IT environments.
Hardware: HP ProLiant DL/ML series, Dell PowerEdge R620/R720 and Supermicro X8DTU-LN4+( Hands on experience in Racking, stacking servers in Datacenter)
OS Platforms: Linux & Unix (RHEL/Centos 5.x,6.x), Windows Server 2003/2008)
Linux Daemons & Utilities: Package and configuration management using rpm, yum, dpkg and Puppet, File Sharing on Linux/Windows hosts with SMB/CIFS via Linux/Windows servers - Utilizing ZFS and NFS3/4, iSCSI initiator/targets setups on Linux hosts, Version control with Subversion & Git systems, FTP, SSH, SCP Protocols (vsftpd, proftpd, openssh, rsync, netcat)
Web Servers software: Apache2.x, Tomcat 5/6 server setup and administration, Expert In configuring Reverse web proxy (mod_proxy, jk_mod), cache & load balancing techniques with HaProxy
Cloud Computing And Virtualization: Rigorous hands on experience, working with Amazon (AWS)
API to provision, maintain EC2 instances and EBS storage, S3 bucket, Glacier, VMware vSphere ESX.
Hadoop / Big Data: Consulting on Hadoop ecosystem : Hadoop, MapReduce, YARN, Hbase, Sqoop, Amazon Elastic Map Reduce (EMR). Have designed, deployed Hadoop clusters versions CDH 3, CDH4, CDH5
Programming Languages: Shell Scripting, Awk/Sed
July 2010 to May 2015
IT Manager/Senior System Engineer July 2010 to June 2013Opera Solutions Opera Solutions LLC － Jersey City, India, NJ
Opera Solutions is a Big Data company, which turns Big Data into exponential bottom-line improvement.
Opera uses advanced science to extract predictive Signals from Big Data and turn them into Best Actions packaged in as-a-service software solutions.
The Linux infrastructure comprises of 290 servers in total, with around 140 physical and 150 virtual servers.
The physical server stack consists of variety of Hardware including HP ProLiant DL360 G7/DL380p Gen8/DL580 G5/BL680c G5/, Dell PowerEdge R620/R720, Supermicro X8DTU-LN4+.
Most of the virtual environment is hosted on Vmware at Opera Datacenter, some is on Amazon AWS.
The environment is a mix of Redhat Enterprise Linux 5.x/6.x and Centos 5.x/6.x running on x86_64 architecture.
During my tenure of 4.5 years at Opera, I have been responsible for ensuring technical excellence in Linux infrastructure, learning new technologies and designing new systems needed while Opera started transforming from a regular Management consulting firm to a Product company.
Roles & Responsibilities At Opera Solutions Discuss, plan and build Linux system architectures purposed for webservers, application servers and big data platform Hadoop.
Leading a team of 8 system admins.
Design, deploy and maintain Cloudera Hadoop Clusters.
We have nearly 15 production Hadoop clusters running for separate projects that I designed and deployed.
Extensive usage of cloud server instances and storage from Amazon AWS, to enable client web applications to scale horizontally and vertically, and to have an on demand scalable Dev environment.
Documentation of system design, remote monitoring/alerting of critical client services, and root cause analysis and resolution of issue(s).
Designed, deployed in house Spacewalk server to serve as internal yum server.
Designed and implemented SSO authentication method, which uses Active directory domain accounts on all Linux servers, using winbind daemon.
To fix the SID to uid/gid mapping issue, used a LDAP server, that stores all the uid/gid mappings in its ldif files.
Tuning of various Kernel parameters using sysctl and virtual memory subsystem to improve system performance.
Assigned IRQ affinity to Processors to improve Real-Time performance.
Configure and tune LVM volumes.
Designed and deployed web Reverse proxy solutions using mod_proxy, jk_mod modules in apache.
Performance monitor using sar, iotop, iostat, top, vmstat, mpstat, pmap, iptraf Crash dump analysis by using Crash utility to analyze the crashdump file generated by kdump, diskdump and Netdump.
Tuning of system performance by system control variables stored in /proc/sys Filesystem tuning to reduce I/O latency by disabling mounting options like atime and tuning bdflush.
Designed, deployed monitoring solutions for all dev and prod Linux servers, tools like Nagios, Sargraph, graylog, opennms Deployed open source, inventory tool OCSInventory Work with application developers to deploy new tomcat builds, discuss possible solutions for given requirement changes or upcoming project needs on linux systems.
Co-ordinate with enterprise backup teams to ensure failed backup jobs are taken care of and perform RCA activities.
Perform any host-level tasks needed to add new SAN volume on a given linux server.
Defined and designed sudo access restrictions for developers and Analytics team to ensure data security.
Senior System Administrator April 2009 to July 2010DC Ops - HCL Tech. IOMC
Client-Dowjones & co.) The project involved a blend on Linux system administration and Operations role along with level 2 support for a number of critical applications, me and my team was working from HCL facility at India to support Dowjones & co.
business critical assets.
Whole environment had over 1000 servers, 200 windows, 300 however my team was responsible for nearly 100 servers for system support.
The frontline of support was Helpdesk team, who would escalate the issue to us if the impact is global or if we encounter Critical alerts on monitoring systems, I was responsible for providing the first line of help for linux system support, if required we would involve the level 3 system admins for more help.
Roles & Reponsiblities Installation and configuration of Redhat Linux servers for dev environment.
Monitoring Production servers by means of variety of monitoring tools like: Sitescope, Cacti, Splunk, BMC Patrol and take appropriate actions respectively.
Troublshooting Performance Issues, using sar, iotop, iostat, top, vmstat, mpstat, pmap, iptraf Responsible for performing emergency/scheduled failover for a number for mission critical applications like DJIA (Dowjones Industrial Averages) and DJ Indexes whenever required, so as to maintain High availability.
Performing weekly/monthly maintenance change tasks, which involves patch update.
Configure and tune LVM volumes.
Monitoring and managing Webservers(Apache), FTP servers Meeting the SLA defined as per the pre-categorized Severity 1,2,3 & 4 issues.
Hosting and Attending Change meetings for corresponding Line of Businesses.
Performing SOP(Standard Operating Procedure) for troubleshooting variety of issues, also performing emergency failover of apps on production server.
Coordinating with different support groups and respective LOB's availability managers to conduct meetings for root cause analysis of Sev 1 & 2 issues.
Specialist-EUC - HCL Comnet Ltd － Gurgaon, India
4th Feb '08 - 31st March 2009) (Client-Etrade Financials) The Project involved first level of system support related to Linux environment, it was a blend of Global access management team and production support.
Roles & Reponsiblities User administration: creating new accounts on Redhat Directory Server, managing group access.
Provisioning sudo access.
Troubleshooting performance issues and seek help from level 3 system admins for further support.
Package management: using rpms, yum and compiling from source, installing python packages, compiling python binaries from source.
Administering FTP servers.
Defending escalations from support groups and conducting internal trainings for the team both technical and process oriented.
Rebooting apps servers, recycling services on WebServers and FTP servers whenever required as a part of L1 support on windows/linux/unix based production servers.
Bachelor of Engineering (BE) : Information TechnologyCareer InstituteInformation Technology
Of Technology & Management Faridabad, India in June 2007
*RHCE Certified, Certificate # 100-162-734
Active directory, Apache2.x, Apache, API, Awk, backup, big data, BMC Patrol, c, Hardware, configuration management, Consulting, Client, Version control, Database, Dell, designing, Documentation, Financials, firewall, FTP, HP, http, inventory, LDAP, Linux 5.x, Linux, managing, Management consulting, meetings, memory, access, Windows, mod, Mysql, Enterprise, NFS3, Network Security, OS, PowerEdge, Processors, Programming, ProLiant, Protocols, proxy, python, Real-Time, Redhat Linux, Redhat, requirement, SAN, scanning, SSH, Sed, Sendmail, servers, Shell Scripting, SLA, SMTP, SOP, system administration, system design, tomcat, Tomcat 5, Troubleshooting, Unix, Utilities, web applications, Web Servers, Windows Server, x86
Resumes, and other information uploaded or provided by the user, are considered User Content governed by our Terms & Conditions. As such, it is not owned by us, and it is the user who retains ownership over such content.
Companies Worked For:
Opera Solutions Opera Solutions LLC
DC Ops - HCL Tech. IOMC
Specialist-EUC - HCL Comnet Ltd
Job Titles Held:
IT Manager/Senior System Engineer
Senior System Administrator
Bachelor of Engineering (BE) : Information Technology Of Technology & Management Faridabad, India in June 2007
*RHCE Certified, Certificate # 100-162-734
Create a job alert for [job role title] at [location].