Spanning two decades, the Encyclopaedia of DNA Elements (ENCODE) is a collaborative research project that aims to identify all the functional elements in the human and mouse genomes. To best serve the scientific community, all data generated by the consortium is shared through a web-portal (https://www.encodeproject.org/) with no access restrictions. The fourth and final phase of the project added a diverse set of new samples (including those associated with human disease), and a wide range of new assays aimed at detection, characterization and validation of functional genomic elements. The ENCODE data portal hosts results from over 23,000 functional genomics experiments, over 800 functional elements characterization experiments (including in vivo transgenic enhancer assays, reporter assays and CRISPR screens) along with over 60,000 results of computational and integrative analyses (including imputations, predictions and genome annotations). The ENCODE Data Coordination Center (DCC) is responsible for development and maintenance of the data portal, along with the implementation and utilisation of the ENCODE uniform processing pipelines to generate uniformly processed data. Here we report recent updates to the data portal. Specifically, we have completely redesigned the home page, improved search interface, added several new pages to highlight collections of biologically related data (deeply profiled cell lines, immune cells, Alzheimer's Disease, RNA-Protein interactions, degron matrix and a matrix of experiments organised by human donors), added single-cell experiments, and enhanced the cart interface for visualisation and download of user-selected datasets.
翻译:跨越20年,DNA元件百科全书(ENCODE)是一个旨在确定人类和小鼠基因组中的所有功能元件的协作研究项目。为了最好地为科学界服务,贡献者生成的所有数据都通过网站(https://www.encodeproject.org/)与无访问限制地分享。该项目的第四个也是最后一个阶段添加了一组多样化的新样本(包括与人类疾病相关的样本),以及旨在检测、表征和验证功能基因组元件的各种新试验。ENCODE数据门户托管了超过23,000个功能基因组学实验的结果、超过800个功能元件表征实验(包括体内转基因增强子试验、记者试验和CRISPR筛选)以及超过60,000个计算方法和综合分析结果(包括插补、预测和基因组注释)。ENCODE数据协调中心(DCC)负责开发和维护数据门户,同时实施和利用ENCODE统一处理流程来生成统一处理的数据。在这里,我们报告关于数据门户的最新更新。具体来说,我们已经完全重新设计了主页,改善了搜索界面,增加了几个新页面以突出展示生物相关数据的集合(深度分析的细胞系、免疫细胞、老年痴呆、RNA-蛋白相互作用、降解基质和按人类供体组织组织的试验矩阵),增加了单细胞实验,并增强了购物车界面,以可视化和下载用户选择的数据集。