The EMBL-EBI Job Dispatcher framework (https://www.ebi.ac.uk/services) provides free access to Bioinformatics tools and related data via a web browser and via Web Services. In this talk, I would like to demonstrate the ‘Tools as a service’ capabilities of the framework and how to integrate these services into analysis workflows.
EBI Search, a search engine allowing access to data available from EMBL-EBI, provides extended search and results features in addition to text searching; These include bi-directional cross-reference searching, finding similar entries (faceting and more-like-this) and searching applied on sequences analysis results, which will be on the agenda of the talk.
In this talk we will demonstrate using the RNAcentral database to explore non-coding RNA sequences. We will focus on examining a genomic region for ncRNA, searching for ncRNA sequences by name, and finding ncRNA sequences similar to novel sequences. This talk is aimed at anyone interested in ncRNA data
We will present updates to UniProt including integration of clinically relevant data within new tools and services on the UniProt website and as programmatic services to facilitate the integration of the UniProtKB into users’ analysis pipelines. We will also introduce a collection of portable web components to visualise protein knowledge.
In this session we will discuss the infrastructure and requirements for buliding the data pipeline for AI and analytics in context of use cases in bio medicine and R&D.
Research has become increasingly complex and compute intensive. While new tools and analytical processes such as artificial intelligence (AI) and deep hold great promise, they stress the supporting IT infrastructure. Learn how today's storage systems leverage software to deliver the performance, scale, and cost efficiencies for tomorrow's life sciences applications.
The demonstration will be focusing on the utility of our core BioThings APIs (targeting bioinformaticians and researchers as the API users); and the set of Biomedical API development tools like BioThings SDK, BioThings Studio and BioThings Explorer (targeting API developers).
23andMe is a company is built on science. We make evidence-based decisions. In a world where health is poorly understood, we use data to better understand wellness and disease. We bring science to our customers and help them use it in meaningful ways. In 2015 we began a Therapeutics division (https://therapeutics.23andme.com/) to use our genotype and phenotype data to help develop new therapies for diseases and conditions with serious unmet medical need. In this talk I will discuss how we use our large genotype and phenotype data base to drive the discovery of novel therapeutics. I will show some examples where we find known targets to exemplify the approach. I will discuss how different ethnicities and founder populations can provide us with novel insights. I will discuss some of the challenges in going from a genetic signal to a drug target.
In this session, we will showcase eLife’s first computationally reproducible article as part of the Reproducible Document Stack (RDS) project. We will demonstrate how researchers can create reproducible articles with Stencila Desktop, an open-source, user-friendly manuscript editor, and outline our plans to develop RDS into a scalable, production-ready solution.
PDFigCapX and FigSplit - a Pipeline for Extracting Figures, SubFigures and Captions from Biomedical Publications: Supporting Bio-Curation and Discovery
Room: Kairo 1/2 (Ground Floor)
Hagit Shatkay, University of Delaware, United States
Pengyuan Li, University of Delaware, United States
Figures and captions convey essential information in biomedical publications. We introduce an effective pipeline, comprising two systems, toward curation-support of published figures: PDFigCapX for identifying and extracting figures and captions from publications, and FigSplit for splitting extracted compound multi-panel images into their constituent subfigures.
Phyre2 – a web-server to predict protein structure from sequence (http://www.sbg.bio.ic.ac.uk/phyre2 )
PhyreRisk – a dynamic database mapping variants onto experimental and Phyre-predicted structures( http://phyrerisk.bc.ic.ac.uk )
Missense3D – a web server that reports the structural impact of a missense variant on experimental and predicted structures http://www.sbg.bio.ic.ac.uk/~missense3d/ )
Isabl is a plug-and-play open-source framework for scalable bioinformatics operations designed to support automated processing and management of NGS assets and their metadata. Isabl's patient centric model enables advanced integrations with multinomial data types and linkage to institutional information systems such as clinical databases and visualization interfaces.
It takes many years from studying diseases and presuming suitable targets to finally filing a drug for approval by health authorities. In this talk, we will describe the “daily life” of bioinformaticians in Roche and how all steps along the development chain are driven by data.
The geneXplain platform is an online toolbox and workflow management system for a broad range of bioinformatic and systems biology applications that has been used in a number of studies. This demonstration introduces data management and analysis capabilities of the platform highlighting its utility for network analysis and drug discovery.
Fairspace, a new open source project from The Hyve, helps organizations quickly stand up a FAIR compliant metadata, data governance and auditing layer over existing storage and applications. Fairspace is developed in collaboration with Institut Curie (Paris), and aims to contribute to the growing open source ecosystem around FAIR Data.
SWISS-MODEL (swissmodel.expasy.org) is a state-of-the-art computational protein structure modelling workbench with a user-friendly web interface. This talk will present the fully automated SWISS-MODEL workflow to predict high-quality three-dimensional structures of proteins and complexes in their most likely quaternary structure starting from the respective amino acid sequences.
Advances to the GeneWeaver software system for the integration, analysis and visualization of genomics data across species will be presented, including its redevelopment as a suite of RESTful API services. Applications in interpretation of human genetics and navigation of heterogeneous network graphs of multi-species data will be presented.
An update on the tools, methods and datasets available from the CATH protein structure classification database, including the latest developments on clustering functional families (FunFams) within hugely populated evolutionary superfamilies.
MISO is a laboratory information management system designed for eukaryotic sequencing operations, supporting many different -omic sequencing protocols, long and short reads, and microarrays. Prioritizing accuracy, simplified data entry, and metadata integrity, MISO incorporates a wide feature set useful for facilities to track their lab workflows in great detail.