美國國衛院優化30萬作者手稿文件,利於文字探勘等應用分析

TAG: NLM Data Mining

2015/12/03     來源:NLM閱讀原文

NIH-supported scientists have made over 300,000 author manuscripts available on PubMedCentral (PMC) since 2008. Now, NIH is making these papers accessible to the public in a format that will allow robust text analyses.
 
You can download the entire PMC collection of NIH-supported author manuscripts as a package in either XML or plain text formats. The collection will encompass all NIH manuscripts posted to PMC since July 2008. While the public can access the articles’ full text and accompanying figures, tables, and multimedia on the PMC website, the newly available article packages include full-text only, in a form that facilitates text-mining....more