The service is requested as part of project Global omics Data Management (gODM) - expansion to clinical area. The project has the purpose to devise and implement the expansion of a productive metadata and data management system for omics data at Boehringer Ingelheim (BI) so that data from clinical area can also be stored. This mainly affects three key infrastructure pieces: (i) metadata repository (MDR), (ii) large file repository (LFR), and (iii) matrix repository (MXR). Moreover, the system under considerations has to be integrated with existing infrastructure and a user authentication & authorization module. Project infrastructure will be based amongst others on cloud technology (AWS, Azure), graph and relational database engines. Continuous development and deployment will be done via utilizing BI's OpenDevStack environment. The performance of the contractor has the goal to accompany the Global ODM realization as an additional core developer. Collaboration will be together with other external BI employee.
Contractor will receive the system access and documentation required for his work accordingly.
Solution is based on an already worked out system architecture, suitable technology standards and will be checked by BI before it can be used in production by end users.
The scope of services includes the following tasks, which are independently performed by the external contractor:
- Independent implementation of infrastructure components as part of a gODM system
- Components shall cover resources in AWS (Amazon Web Services) and on premise to store metainformation, large file data and processed data in the different repositories such as Stardog, S3, PostgreSQL or Aurora.
- Independent development (optionally: R, Python, REST endpoints) of individual API (application programming interface) layers to these resources. This is intended as an extension of already existing interfaces.
- Programming of suitable clients to establish largely automated data transfer via the new API endpoints to be created subsequent to user requirement analysis.
- Independent communication with persons responsible for systems whose service interfaces have to be integrated into the gODM system under development (BI personnel & further project partners); communication will be done via MS Teams, E-Mail or in project meetings coordinated by project manager
- Expansion of gODM to meet a more restrictive authorization concept from clinical area. This includes check and expansion of existing data transfer pipelines (e.g. nextflow) so that access is only permitted via the specific authorization groups for the clinical area.
- Independent writing of system documentation (implementation decisions, system architecture, system maintenance) within project Confluence space. This must be accepted by BI as the last instance at the end.