Genetic Sequence Database Product Owner and Data Wrangler

Bethesda, MD
Full Time
Experienced
Computercraft is looking for a Genetic Sequence Database Product Owner and Data Wrangler to support our work for the National Center for Biotechnology Information (NCBI), part of the National Library of Medicine (NLM) at the National Institutes of Health (NIH).

NCBI, one of the 400 most-visited sites in the world, is the premier biomedical center, hosting over four million daily users in search of clinical, genetic, and other information. NCBI’s wide range of applications (e.g., PubMed, ClinicalTrials.gov), platforms, and environments (e.g., big data [petabytes], machine learning, multiple clouds) serve more users with more data than any other U.S. Government agency. Working on NCBI products, you can help to accelerate the development of cures for diseases like cancer.

The Sequence Archives and Submissions (SeqArch) program needs a Product Owner and Data Wrangler for the GenBank sequence database, a unique scientific resource of human health and genetic data at NCBI. This person will be responsible for coordinating data exchange with the International Nucleotide Sequence Database Collaboration, generating downloadable data for external users, and coordinating targeted updates to the database based on systematic changes in taxonomic information.

In this position you will help manage GenBank’s data-access-related products, tools, and protocols. You will make decisions about the direction of the product and prioritize tasks. You will also work to define development tasks, establish delivery schedules, and ensure compliance with the organization’s policies and procedures.

Job Responsibilities
  • Develop product vision, goals, and strategic roadmaps
  • Lead data-gathering efforts through market research, data analysis, and user research to make balanced, objective decisions and provide clear guidance to delivery teams to create incremental value in an Agile environment
  • Synthesize data-gathering efforts into a logical organization of epics and user stories for the development team
  • Collaborate with users and lead cross-functional teams to define and optimize user workflows to improve user experience
  • Understand customer segments and identify targeted solutions to exceed their needs
  • Lead teams through a complete product lifecycle of discovery to delivery
  • Nurture partnerships with various stakeholders who wish to participate in the sharing of genomic data for research in cloud and conventional environments, using secure cross-agency protocols
  • Participate in external collaborations and work with senior stakeholders
  • Analyze incoming genetic sequence data for trends
  • Prioritize the actions of the product team
  • Critically evaluate datasets and functional annotations to assess quality
  • Monitor automated dataflows for loading data to production databases
  • Provide critical expertise to NCBI in biological data curation of genetic sequences
  • Analyze log files, error files, or test-case “diffs” that can total hundreds of megabytes using tools such as sed, grep, awk, and Perl to confirm known/expected outcomes and identify outlier/problematic outcomes

 Required Skills/Experience
  • B.S. in bioinformatics, molecular biology, data science, computer science, information technology, or a similar field
  • Excellent verbal and written communication skills
  • Genomics/bioinformatics experience
  • Strong understanding of molecular biology concepts
  • Scientific ETL data model experience/skills
  • The ability to troubleshoot technical and staffing roadblocks and mitigate resource risks
  • Experience managing large and cross-functional projects in a complex, policy-driven environment
  • Strong customer engagement, networking, presentation, and collaboration skills
  • Ability to incorporate and diplomatically resolve conflicting priorities from multiple user groups and technical stakeholders
  • Data processing experience in a Linux environment (5+ years)
  • Experience coaching team members and eliminating knowledge silos
Desired Skills/Experience
  • Experience working with GenBank or other sequence databases at NIH or other organizations
  • Experience with data interoperability and sharing standards and policies
  • Experience working with Cloud data storage and processing platforms (e.g., AWS, GCP)
  • Proficiency in at least one scripting language (e.g., BASH, Python)
  • Experience working with large SQL databases involving many tables and billions of data rows
  • Experience with CI/CD pipelines, unit tests, integration, and regression testing
  • Expertise in bioinformatics of sequence analysis and tools including BLAST and multiple sequence aligners
  • Solid understanding of key molecular biology concepts, such as the central dogma that describes the flow of genetic information from gene (DNA) to mRNA to protein
  • Experience working in Product Owner or Product Manager positions in an Agile environment (e.g., developing vision, strategic plan, roadmap, requirements; applying user testing methodologies; prioritizing features based on value and effort)

The compensation for this position will be based on the experience of the successful candidate. The expected pay range for this position is $110,000 to $150,000. 

 

Computercraft offers an excellent benefits package that includes health, dental, vision, and disability and life insurance; a 401(k) plan with matching; paid leave starting at 128 hours/year for the first 3 years of employment; and 11 paid holidays. We also offer the opportunity for a positive work–life balance with a standard 40-hour work week and the chance to work alongside a team of highly accomplished professionals.

To learn about other Computercraft job opportunities, please visit the Careers section of our website: https://www.computercraft-usa.com/.

EEO Employer – Disability/Veteran/Race/Color/Religion/Sex/National Origin/Genetic Information

Share

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

To comply with government Equal Employment Opportunity and/or Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more.

Invitation for Job Applicants to Self-Identify as a U.S. Veteran
  • A “disabled veteran” is one of the following:
    • a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or
    • a person who was discharged or released from active duty because of a service-connected disability.
  • A “recently separated veteran” means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.
  • An “active duty wartime or campaign badge veteran” means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.
  • An “Armed forces service medal veteran” means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.
Veteran status



Voluntary Self-Identification of Disability
Voluntary Self-Identification of Disability Form CC-305
OMB Control Number 1250-0005
Expires 04/30/2026
Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Please check one of the boxes below:

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

You must enter your name and date
Human Check*