OpenEDGAR is an open source Python framework designed to rapidly construct research databases based on the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system operated by the US Securities and Exchange Commission (SEC). OpenEDGAR is built on the Django application framework, supports distributed compute across one or more servers, and includes functionality to (i) retrieve and parse index and filing data from EDGAR, (ii) build tables for key metadata like form type and filer, (iii) retrieve, parse, and update CIK to ticker and industry mappings, (iv) extract content and metadata from filing documents, and (v) search filing document contents. OpenEDGAR is designed for use in both academic research and industrial applications, and is distributed under MIT License at https://github.com/LexPredict/openedgar.
 翻译:OpenEDGAR是一个开放源码 Python框架,旨在根据美国证券交易委员会(证交会)操作的电子数据收集、分析和检索系统迅速建立研究数据库,OpenEDGAR建在Django应用框架之上,支持在一个或多个服务器上进行分布计算,包括以下功能:(一) 从EDGAR检索和分析索引和存档数据,(二) 建立表格,用于表格类型和存档器等关键元数据,(三) 检索、分析并更新CIK,以进行查询和行业制图,(四) 从归档文件中提取内容和元数据,(五) 搜索文件归档内容,OpenEDGAR设计用于学术研究和工业应用,并在https://github.com/LexPredict/openedgar上根据麻省麻省理学研究所许可证分发。