LiveCareer
LiveCareer
  • Dashboard
  • Jobs
  • Resumes
  • Cover Letters
  • Resumes
    • Resumes
    • Resume Builder
    • Resume Examples
      • Resume Examples
      • Nursing
      • Education
      • Administrative
      • Medical
      • Human Resources
      • View All
    • Resume Search
    • Resume Templates
      • Resume Templates
      • Nursing
      • Education
      • Medical
      • Human Resources
      • Customer Service
      • View All
    • Resume Services
    • Resume Formats
    • Resume Review
    • How to Write a Resume
    • CV Examples
    • CV Formats
    • CV Templates
    • Resume Objectives
  • Cover Letters
    • Cover Letters
    • Cover Letter Builder
    • Cover Letter Examples
      • Cover Letter Examples
      • Education
      • Medical
      • Human Resources
      • Customer Service
      • Business Operations
      • View All
    • Cover Letter Services
    • Cover Letter Templates
    • Cover Letter Formats
    • How to Write a Cover Letter
  • Jobs
    • Mobile App
    • Job Search
    • Job Apply Tool
    • Business Letters
    • Job Descriptions
  • Questions
  • Resources
  • About
  • Contact
  • 0Notifications
    • Notifications

      0 New
  • jane
    • Settings
    • Help & Support
    • Sign Out
  • Sign In
Member Login
  • LiveCareer
  • Resume Search
  • Data Scientist, Tech Director & DBA of CiteSeerX
Please provide a type of job or location to search!
SEARCH

Data Scientist, Tech Director & DBA of CiteSeerX Resume Example

Resume Score: 100%

Love this resume?Build Your Own Now
DATA SCIENTIST, TECH DIRECTOR & DBA OF CITESEERX
Career Overview
Ten years work experience under Linux/Unix environment; Latex and MS Excel. · Five years work experience of building/maintaining production MySQL databases and Apache Solr, debugging and optimizing ETL work flows, based on scholarly big data. · Five years work experience of search engine architecture and infrastructure, deploying and implementing web application features · Five years work experience of designing, coding, and testing LAMP website powered by MySQL databases and Apache Solr, using frameworks such as Django and Spring. 1 Update on February 10, 2017 · Five years programming experience with Python; familiar with load balancing, virtual environment, firewall (e.g., iptables), and file systems. · Three years work experience of managing software projects on open source software platforms, e.g., GitHub. · Two years experience of analyzing logs using MapReduce; Deep Learning architectures of RNN and CNN on video data; experience with Amazon AWS, Microsoft Azure Cloud, Google Cloud, and Google Analytics; Experience with NLP tools, Bash, Java, R, Ruby on Rails, RESTful API. · Backgrounds in Physics, Math, and Statistics; Familiar with ML, NLP, ANN, IR, and genetic algorithms.
Work Experience
06/2013 to Current
Data Scientist, Tech Director & DBA of CiteSeerXCompany Name - City, State
  • I started with the web crawling module of CiteSeerX in 2011, then expanded to the full architecture around 2013.
  • My job duties include administrating the MySQL database and Apache Solr index servers, hacking the source code (Python/Java/Perl) to fix security vulnerabilities, developing new web application features, managing 100+ terabytes production and research data, maintaining 30+ physical and virtual servers to facilitate production and research, and developing software to improve web crawling, information classification and extraction.
  • By the end of 2014, I was able to run the entire search engine single handed.
  • In 2015, I proposed infrastructure and software solutions to overcome scalability bottlenecks and blueprinted the next generation of CiteSeerX.
  • By the end of 2016, I had scaled the data collection from 3 million to over 10 million documents.
  • Currently, the system can keep running for several months without major issues.
  • The 200+ page system document wrote by me significantly flattens learning curve for new admins.
  • I used to assist 3+ professors to build private cloud and GPU infrastructure.
  • I also have experience of working on a Hadoop cluster, and programming with MapReduce.
  • Post-doctoral Scholar June 2011 - present.
06/2006 to 05/2011
Research AssistantCompany Name - City, State
  • Utilize astronomical big data, compiled from archives of space- and ground-based telescopes, such as the Hub- ble Space Telescope and the Sloan Digital Sky Survey to investigate important correlations between physical parameters of Active Galactic Nuclei and quasars.
  • Publish 7 peer reviewed journal articles.
08/2004 to 05/2006
Teaching AssistantCompany Name - City, State
  • Lecture non-science college students on astronomical fundamentals.
Education and Training
August, 2011
Ph.D: Astronomy and AstrophysicsPennsylvania State University - City, State, USAAstronomy and Astrophysics
Ph.D: Computational ScienceComputational Science
July 2004
B.S: Physics and AstronomyUniversity of Science and Technology of China HefeiChinaPhysics and Astronomy
Interests
Entity Recognition in Scientific Document Ongoing Leader Research · Recognize and extract semantic domain knowledge entities from scientific documents Video Compression with ANN Ongoing Co-leader Research · Perform near-lossless video compression using artificial neural network models Migrating CiteSeerX to a Private Cloud Published in 2014 Leader System · Migrate CiteSeerX production servers to a private cloud with virtualization techniques Document Classification in Digital Libraries Published in 2014 and 2016 Co-leader Research · Automatically and accurately classify PDF documents with ML and structural features PUBLICATIONS · See http://fanchyna.wixsite.com/jianwu/pubs for all publications. OTHER INFORMATION · PC members of 5 conferences/workshops · Reviewers for 14 top-tier conferences/journals/transactions, including WWW, SIGIR, and TKDE · Collaborated with people from UNT, Microsoft, AllenAI, and Internet Archive 2 Update on February 10, 2017
Skills
Apache, AI, big data, conferences, content, data collection, Database, features, Hub, Java, managing, MySQL, NLP, next, search engines, page, PDF, Perl, programming, proposals, publications, Python, research, scientific, servers, developing software, teaching, typing, articles
Additional Information
  • HONORS AND AWARDS Best paper nomination in the 8th International Conference on Knowledge Capture 2015 Best application paper in the 26th Annual Conference on Innovative Applications of Artificial Intelligence 2014 Best paper nomination in the IEEE International Conference on Cloud Engineering 2014 Zaccheus Daniel Fund 2009 Zaccheus Daniel Fund 2007 Stephen B. Brumbach Fellowship 2006 USTC Excellent Graduate Student Award 2004 SELECTED PROJECTS Entity Recognition in Scientific Document Ongoing Leader Research · Recognize and extract semantic domain knowledge entities from scientific documents Video Compression with ANN Ongoing Co-leader Research · Perform near-lossless video compression using artificial neural network models Migrating CiteSeerX to a Private Cloud Published in 2014 Leader System · Migrate CiteSeerX production servers to a private cloud with virtualization techniques Document Classification in Digital Libraries Published in 2014 and 2016 Co-leader Research · Automatically and accurately classify PDF documents with ML and structural features PUBLICATIONS · See http://fanchyna.wixsite.com/jianwu/pubs for all publications. OTHER INFORMATION · PC members of 5 conferences/workshops · Reviewers for 14 top-tier conferences/journals/transactions, including WWW, SIGIR, and TKDE · Collaborated with people from UNT, Microsoft, AllenAI, and Internet Archive 2 Update on February 10, 2017
Build Your Own Now

DISCLAIMER

Resumes, and other information uploaded or provided by the user, are considered User Content governed by our Terms & Conditions. As such, it is not owned by us, and it is the user who retains ownership over such content.

Resume Overview

School Attended

  • Pennsylvania State University
  • University of Science and Technology of China Hefei

Job Titles Held:

  • Data Scientist, Tech Director & DBA of CiteSeerX
  • Research Assistant
  • Teaching Assistant

Degrees

  • Ph.D : Astronomy and Astrophysics
    Ph.D : Computational Science
    B.S : Physics and Astronomy

Create a job alert for [job role title] at [location].

×

Advertisement

Similar Resumes

View All
Director-of-Data-and-Software-Solutions-resume-sample

Director of Data and Software Solutions

Terre Haute, Indiana

Sr.-Oracle-DBA-resume-sample

Sr. Oracle DBA

Knoxville, Tennessee

Director-of-Information-Systems-resume-sample

Director of Information Systems

Saint Maries, Idaho

About
  • About Us
  • Privacy Policy
  • Terms of Use
  • Sitemap
Help & Support
  • Work Here
  • Contact Us
  • FAQs
Languages
  • EN
  • UK
  • ES
  • FR
  • IT
  • DE
  • NL
  • PT
  • PL
Customer Service
customerservice@livecareer.com
800-652-8430 Mon- Fri 8am - 8pm CST
Sat 8am - 5pm CST, Sun 10am - 6pm CST
  • Stay in touch with us
Site jabber winner award

© 2021, Bold Limited. All rights reserved.