Responsibilities
- Coordinate and Track: Manage data receipt from vendors, research collaborators, and external investigators with precision.
- Pipeline Development: Create efficient data processing pipelines for routine data cleaning and organization tasks.
- Data Cleaning: Perform essential data cleaning, including de-identifying sample identifiers, standardizing metadata, and organizing data according to SOPs.
- Quality Control: Conduct thorough quality control checks on incoming data and released datasets.
- SOP Maintenance: Continuously improve data management SOPs, ensuring quality, compliance, and privacy.
- Data Cataloging: Maintain a comprehensive data catalog documenting dataset descriptions, file locations, Company Base availability, and backup/archive statuses.
Data Sharing and Support
Support investigators: Facilitate data sharing for team's investigators and various company cohorts and collaborations.
Oversee the sharing of datasets received from external investigators
Dataset Updates: Assist with periodic updates to internally-generated datasets
External Inquiries: Address data-related questions from external investigators about released datasets.
Cloud Access: Support data access for cloud platforms.
Preferred Qualifications
Education: PhD or Master's degree in computer science, data science, engineering, or a related field. Minimum of 3 years of experience in data management.
Required Skills:
- Experience managing large-scale data in an HPC environment.
- Proficiency in Linux and Bash, including data permissions management.
- Strong Python skills, particularly with Pandas and similar packages.
- Experience using git/github.
- Exceptional organizational skills and attention to detail.
- Effective communication skills, both oral and written.
- Ability to thrive in collaborative environments.
Helpful Skills
- Project management experience.
- Familiarity with genomics data and file types.
- Knowledge of popular bioinformatics command-line tools.
- Experience with data processing and storage on cloud platforms.
- Experience creating and/or hosting dashboards or platforms for data visualization and interaction.
Compensation and Benefits
Competitive salary and comprehensive benefits package.
Benefit Highlights
- 10% 401K match
- Full medical care coverage
- Breakfast and Lunch provided in office
Logistics
* This opportunity requires onsite scheduling in NYC
* Company is able to provide relocation package to relocate candidates