Bioinformatician - Pandaomics (Uae)

5 months ago


Remote, United Kingdom Insilico Medicine Full time

**About the Role**

As a Bioinformatician specializing in OMICs data preprocessing, you will play a crucial role in our team's efforts to analyze and derive insights from various high-throughput biological datasets. Your primary responsibility will be to establish robust pipelines for unified preprocessing of diverse OMICs data types, including transcriptomics, methylation, proteomics, and more. By ensuring the efficient processing, quality control, and integration of these data, you will contribute to the development of a comprehensive and up-to-date main database.

In this role, you will collaborate closely with multidisciplinary teams of researchers, data analysts and software developers to support their investigations and provide them with reliable, preprocessed data for downstream analysis. Your expertise in Python and R programming languages will be essential for implementing algorithms, statistical analyses, and visualization techniques. Additionally, your understanding of common open storages of OMICs data and version control systems will enable data retrieval, integration, and tracking.

To succeed in this role, you should be an effective communicator with strong written and verbal English skills. Your ability to work autonomously, manage multiple projects, and collaborate effectively within a team will be critical. A solid foundation in molecular biology and genetics will further enhance your comprehension of the underlying biological context and enable you to provide valuable insights during data analysis and interpretation.

**Place of work**

Level 6, Unit 08, Block A, IRENA HQ Building Masdar City, Abu Dhabi United Arab Emirates

**Reports to**

PandaOmics Development and Data Management Lead

**Responsibilities**:

- Develop and maintain bioinformatics pipelines for the preprocessing and analysis of various OMICs data types, including transcriptomics, methylation, proteomics, genetic variants, and more.
- Establish and maintain efficient workflows for data retrieval, integration, and storage from common open storages of OMICs data, ensuring data integrity and accessibility.
- Collaborate with multidisciplinary teams to understand their data requirements, provide guidance on data preprocessing, and ensure the delivery of high-quality preprocessed data for downstream analysis.
- Stay updated with the latest bioinformatics methodologies, tools, and data analysis techniques through scientific literature review, and incorporate relevant findings into projects.
- Collaborate with software developers and IT teams to ensure seamless integration of bioinformatics pipelines with existing systems and databases.
- Document bioinformatics pipelines, methodologies, and data processing workflows to facilitate reproducibility and knowledge sharing within the team.

**General Requirements**:
**I. Education**
- Bachelor's degree or Master's degree

**II. Experience and Skills**
- Experience in Python programming
- Familiarity with R programming and main Bioconductor packages
- Knowledge of common open storage systems for OMICs data, such as Gene Expression Omnibus (GEO), Sequence Read Archive (SRA), ENCODE, or ArrayExpress
- Expertise in preprocessing and analysis of various OMICs data types, including transcriptomics, methylation and genomic data
- Effective communication skills in English
- Combined autonomous and team-oriented work approach: capable of working independently, managing tasks; on the other hand collaborating within a team, sharing knowledge, and contributing to collective goals
- Experience in scientific literature review and research
- General knowledge of molecular biology and genetics
- Familiarity with version control systems like Git (e.g., GitHub, GitLab) to be able to manage code repositories, track changes, and collaborate effectively with other team members
- Experience with bioinformatics pipelines and workflow management systems like AirFlow is desirable, however not mandatory.