Bioinformatics Engineer
6 months+ Contract
100% Remote
Bioinformatics Engineer to build and maintain systems that turn raw scientific data into highly useful, accessible information that supports biologic drug development. The person in this role will architect, construct, and administer component technologies, including analytical databases and cloud data processing pipelines.
The solutions established and maintained by the Engineer will collect data from a variety of sources (e.g., laboratory instruments, sensors, probes, LIMS software). These solutions will store, transform, integrate, and make data available in formats useful for additional query and analysis in the cloud. The data will feed webapps, dashboards, reports, and data science tools, helping to generate critical insights across research, development, and manufacturing.
Essential Duties and Responsibilities
• Employ a variety of languages and tools to combine data sources and create reliable data pipelines
• Design and implement the structure and functional capabilities of key data resources (e.g., data base schema, data lake layers)
• Architect appropriate data solutions to meet research and business needs while also ensuring fit within Lumen's cloud ecosystem
• Develop scripts that transform data into useful formats for further analysis by researchers
• Investigate opportunities for additional data acquisition and processing automation; working closely with scientist and bioinformatics team members to define key requirements
• Assist in the development and implementation of standard data tables, reports, dashboards, and visualizations that support scientific research and business goals
• Provide recommendations to improve data quality, reliability, and processing efficiency
• Collaborate with consultants and contractors to provide sound data solutions built for performance, reliability, and security
• Administer data infrastructure and resources to help ensure data security and integrity
• Facilitate legacy data migrations
• Document system design
• Where appropriate, provide end-user support, documentation, and training
Desired Qualifications and Requirements
• B.S./B.A. degree in Computer Science or related field, or equivalent work experience
• Demonstrated experience developing advanced SQL queries and building relational databases
• Proficiency in shell scripting and especially Python. Majority of the work will be converting existing scripts into python.
• Demonstrated experience with ETL development
• In-depth knowledge of cloud computing platforms (i.e., AWS, GCP, or Azure)
• Experience processing large, multi-dimensional datasets from diverse, distributed sources
Desired Qualifications:
• Biotechnology experience
• Cloud data lake and data warehouse experience
• Exposure to software validation process