{"id":522,"date":"2014-02-17T23:11:45","date_gmt":"2014-02-17T18:11:45","guid":{"rendered":"http:\/\/blogs.nd.edu\/emorgan\/?p=522"},"modified":"2014-04-15T20:46:47","modified_gmt":"2014-04-15T15:46:47","slug":"prospect","status":"publish","type":"post","link":"https:\/\/sites.nd.edu\/emorgan\/2014\/02\/prospect\/","title":{"rendered":"CrossRef&#8217;s Prospect API"},"content":{"rendered":"<p>\nThis is the tiniest of blog postings outlining my experiences with a fledgling <a href=\"https:\/\/prospect.crossref.org\" target=\"_blank\">API called Prospect<\/a>.\n<\/p>\n<p>\nProspect is an API being developed by CrossRef. I learned about it through both word-of-mouth as well as a blog posting by <strong>Eileen Clancy<\/strong> called &#8220;<a href=\"https:\/\/dhpraxisf13.commons.gc.cuny.edu\/2014\/01\/15\/easy-access-to-data-for-research-text-mining\/\" target=\"_blank\">Easy access to data for text mining<\/a>&#8220;. In a nutshell, given a CrossRef DOI via content negotiation, the API will return both the DOI&#8217;s bibliographic information as well as URL(s) pointing to the location of full text instances of the article. The purpose of the API is to provide a straight-forward method for acquiring full text content without the need for screen scraping.\n<\/p>\n<p>\nI wrote a simple, almost <a href=\"http:\/\/dh.crc.nd.edu\/sandbox\/prospect\/extractor.pl\" target=\"_blank\">brain-deal Perl subroutine<\/a> implementing the API. For a good time, I put the subroutine into action in a <a href=\"http:\/\/dh.crc.nd.edu\/sandbox\/prospect\/search.cgi\" target=\"_blank\">CGI script<\/a>. Enter a simple query, and the script will search CrossRef for full text articles, and return a list of no more than five (5) titles and their associated URL&#8217;s where you can get them in a number of formats.\n<\/p>\n<p><img src='http:\/\/blogs.nd.edu\/emorgan\/files\/2014\/02\/screen-shot.jpg' alt='screen shot' \/><br \/>\nscreen shot of CrossRef Prospect API in action<\/p>\n<p>\nThe API is pretty straight-forward, but the URLs pointing to the full text are stuffed into a &#8220;Links&#8221; HTTP header, and the value of the header is not as easily parseable as one might desire. Still, this can be put to good use in my slowly growing stock of text mining tools. Get DOI. Feed to one of my tools. Get data. Do analysis.\n<\/p>\n<p>\nFun with HTTP.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This is the tiniest of blog postings outlining my experiences with a fledgling API called Prospect. Prospect is an API being developed by CrossRef. I learned about it through both word-of-mouth as well as a blog posting by Eileen Clancy called &#8220;Easy access to data for text mining&#8220;. In a nutshell, given a CrossRef DOI [&hellip;]<\/p>\n","protected":false},"author":92,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-522","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts\/522","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/users\/92"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/comments?post=522"}],"version-history":[{"count":3,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts\/522\/revisions"}],"predecessor-version":[{"id":526,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts\/522\/revisions\/526"}],"wp:attachment":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/media?parent=522"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/categories?post=522"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/tags?post=522"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}