{"id":1038,"date":"2021-12-06T00:46:37","date_gmt":"2021-12-05T19:46:37","guid":{"rendered":"https:\/\/sites.nd.edu\/emorgan\/?p=1038"},"modified":"2021-12-06T00:46:39","modified_gmt":"2021-12-05T19:46:39","slug":"the-distant-reader-an-update","status":"publish","type":"post","link":"https:\/\/sites.nd.edu\/emorgan\/2021\/12\/the-distant-reader-an-update\/","title":{"rendered":"The Distant Reader: An update"},"content":{"rendered":"<p><img decoding=\"async\" align=\"center\" alt=\"logo\" src=\"https:\/\/mcusercontent.com\/9f921d14ece40484d800d5f70\/images\/f86331d5-879a-bbe6-20da-f5ff4e1da493.jpg\" width=\"564\" style=\"max-width: 1322px;padding-bottom: 0;vertical-align: bottom;border: 0;height: auto;text-decoration: none\" class=\"mcnImage\"><\/p>\n<p><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\">A number of things have been happening with the <a href=\"https:\/\/distantreader.org\" target=\"_blank\" style=\"color: #007C89;font-weight: normal;text-decoration: underline\" rel=\"noopener\">Distant Reader<\/a>, and I hope to share them here.<\/span><\/font><\/p>\n<ol>\n<li><strong style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Reader in use<\/strong><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> &#8211; The Reader is currently being used in a number of ways. For example, it has become a part of a data science class here at Notre Dame. It is being used in project to predict possible violent attacks. I use it on a regular basis and it has helped me understand: 1) the role of small farmers in developing countries, 2) how the Psalms have evolved over time, 3) the degree I can summarize thousands of medical documents, and 4) the similarities and difference between the novels of Jane Austen.<\/span><\/font><\/li>\n<li><strong style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Bibliography<\/strong><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> &#8211; Over the past few years, I have written a number of<\/span><\/font><a href=\"https:\/\/bit.ly\/3GclL87\" style=\", helvetica, arial, verdana, sans-serif;font-size: 14px;color: #007C89;font-weight: normal;text-decoration: underline\" target=\"_blank\" rel=\"noopener\"> blog postings describing the Reader<\/a><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\">, and it is linked here in the hopes of providing you with more meaningful messages about: 1) what the Reader is, 2) what it is designed to do, and 3) how to use it.<\/span><\/font><\/li>\n<li><strong style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Reader Toolbox<\/strong><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> &#8211; The Distant Reader takes sets of unstructured data as input, applies various text mining techniques against it, and outputs a set of structured data intended for analysis &#8212; &#8220;reading&#8221;. These data sets, called &#8220;study carrels&#8221;, are very amenable to computer processing. Consequently I have developed a thing called the <\/span><\/font><a href=\"https:\/\/bit.ly\/3GfDYSz\" style=\", helvetica, arial, verdana, sans-serif;font-size: 14px;color: #007C89;font-weight: normal;text-decoration: underline\" target=\"_blank\" rel=\"noopener\">Reader Toolbox<\/a><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> which makes it easy to do all sorts of feature extraction (ngrams, parts-of-speech, named entities, URLs, etc.), topic modeling, semantic indexing, and full text indexing against study carrels. Give it a try!<\/span><\/font><\/li>\n<li><strong style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Reader Library<\/strong><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> &#8211; A fledgling <\/span><\/font><a href=\"https:\/\/bit.ly\/32WZ6yy\" style=\", helvetica, arial, verdana, sans-serif;font-size: 14px;color: #007C89;font-weight: normal;text-decoration: underline\" target=\"_blank\" rel=\"noopener\">collection of previously created study carrels<\/a><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> is in the process of being curated. The collection includes about 3,000 carrels on topics ranging from big ideas (love, honor, truth, justice, etc.) to COVID. The content of the carrels comes from places like Project Gutenberg, the HathiTrust, the &#8216;Net in general, and a data set called CORD-19. In the coming months I hope to create various indexes against the collection. Right now the iterface is functional but pretty raw.<\/span><\/font><\/li>\n<li><strong style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Sponsorships<\/strong><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> &#8211; The Reader has been supported by a number of groups over the past few years. Most recently, support has come from <\/span><\/font><em style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Microsoft&#8217;s AI for Health<\/em><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> initiative as well as the <\/span><\/font><em style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Pittsburgh Supercomputer Center<\/em><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> through an organization called <\/span><\/font><em style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">XSEDE<\/em><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\">. All good things come to an end, and this&nbsp;will soon be true of their support. <em>Thank you very much!<\/em> I will be looking for a new home for the Distant Reader. Got any ideas?<\/span><\/font><\/li>\n<li><strong style=\"font-family:helvetica neue,helvetica,arial,verdana,sans-serif;font-size:14px\">Just for fun<\/strong><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> &#8211; Lastly, you might want to listen to a podcast. Excellently produced by the folks at Lost In The Stacks, it is a <\/span><\/font><a href=\"https:\/\/bit.ly\/31Ll4S3\" style=\", helvetica, arial, verdana, sans-serif;font-size: 14px;color: #007C89;font-weight: normal;text-decoration: underline\" target=\"_blank\" rel=\"noopener\">humorous interview<\/a><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\"> where I describe the Reader.<\/span><\/font><\/li>\n<\/ol>\n<p><font face=\"helvetica neue, helvetica, arial, verdana, sans-serif\"><span style=\"font-size:14px\">Thank you for&#8230; reading.<\/span><\/font><\/p>\n","protected":false},"excerpt":{"rendered":"<p>A number of things have been happening with the Distant Reader, and I hope to share them here. Reader in use &#8211; The Reader is currently being used in a number of ways. For example, it has become a part of a data science class here at Notre Dame. It is being used in project [&hellip;]<\/p>\n","protected":false},"author":92,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[10716],"tags":[],"class_list":["post-1038","post","type-post","status-publish","format-standard","hentry","category-distant-reader"],"_links":{"self":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts\/1038","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/users\/92"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/comments?post=1038"}],"version-history":[{"count":12,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts\/1038\/revisions"}],"predecessor-version":[{"id":1050,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/posts\/1038\/revisions\/1050"}],"wp:attachment":[{"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/media?parent=1038"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/categories?post=1038"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sites.nd.edu\/emorgan\/wp-json\/wp\/v2\/tags?post=1038"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}