OTMI Repository

From OpenTextMining

Jump to: navigation, search

The OTMI repository (on http://www.nature.com/) currently hosts 2 years (2005, 2006) worth of content for 5 journals:

  • Nature (nature)
  • Nature Genetics (ng)
  • Nature Reviews Drug Discovery (nrd)
  • Nature Structural & Molecular Biology (nsmb)
  • The Pharmacogenomics Journal (tpj)

Directories of available OTMI files are provided using OPML files.

OTMI files are available at issue or article levels:



Note 1: If you are using a command-line tool such as curl or wget you may need to add the following option to preserve the compressed file:

 --header 'Accept-Encoding: compress, gzip'

e.g.

 % curl --header 'Accept-Encoding: compress, gzip' 'http://www.nature.com/tpj/journal/v5/n1/otmi/otmi-contents.tar.gz'

and likewise for the MD5 digest

 % curl --header 'Accept-Encoding: compress, gzip' 'http://www.nature.com/tpj/journal/v5/n1/otmi/otmi-contents.tar.gz.md5'



Note 2: MD5 digests are provided using the md5sum utility. To check the MD5 digests issue the following command which should give the response shown:

 % md5sum -c otmi-contents.tar.gz.md5
 otmi-contents.tar.gz.md5: OK

For some information on using md5sum see e.g. the article Using MD5SUM to Validate the Integrity of (Downloaded) Files.

Personal tools