

{"id":4968,"date":"2017-02-01T23:54:12","date_gmt":"2017-02-02T04:54:12","guid":{"rendered":"https:\/\/rud.is\/b\/?p=4968"},"modified":"2018-03-10T07:55:44","modified_gmt":"2018-03-10T12:55:44","slug":"exploring-news-coverage-with-newsflash","status":"publish","type":"post","link":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/","title":{"rendered":"Exploring News Coverage With newsflash"},"content":{"rendered":"<p>I was enthused to see a mention of <span class=\"removed_link\" title=\"https:\/\/www.washingtonpost.com\/news\/monkey-cage\/wp\/2017\/02\/01\/how-much-coverage-did-cnn-actually-devote-to-clintons-emails-heres-the-data\/\">this<\/span> on the <a href=\"https:\/\/blog.gdeltproject.org\/much-coverage-cnn-actually-devote-clintons-emails-heres-data\/\">GDELT blog<\/a> since I&#8217;ve been working on an <a href=\"https:\/\/github.com\/hrbrmstr\/newsflash\">R package<\/a> dubbed <code>newsflash<\/code> to work with the API that the form front-ends.<\/p>\n<p>Given the current climate, I feel compelled to note that I&#8217;m neither a Clinton supporter\/defender\/advocate nor a ? supporter\/defender\/advocate) in any way, shape or form. I&#8217;m only using the example for replication and I&#8217;m very glad the article author stayed (pretty much) non-partisan apart from some color commentary about the predictability of network coverage of certain topics.<\/p>\n<p>For now, the <code>newsflash<\/code> package is configured to grab raw count data, not the percent summaries since folks using R to grab this data probably want to do their own work with it. I used the following to try to replicate the author&#8217;s findings:<\/p>\n<pre id=\"nfclinton01\"><code class=\"language-r\">library(newsflash)\r\nlibrary(ggalt) # github version\r\nlibrary(hrbrmisc) # github only\r\nlibrary(tidyverse)\r\nstarts &lt;- seq(as.Date(&quot;2015-01-01&quot;), (as.Date(&quot;2017-01-26&quot;)-30), &quot;30 days&quot;)\r\nends &lt;- as.character(starts + 29)\r\nends[length(ends)] &lt;- &quot;&quot;\r\n\r\npb &lt;- progress_estimated(length(starts))\r\nemails &lt;- map2(starts, ends, function(x, y) {\r\n  pb$tick()$print()\r\n  query_tv(&quot;clinton&quot;, &quot;email,emails,server&quot;, timespan=&quot;custom&quot;, start_date=x, end_date=y)\r\n})\r\n\r\nclinton_timeline &lt;- map_df(emails, &quot;timeline&quot;)\r\n\r\nsum(clinton_timeline$value)\r\n## [1] 34778\r\n\r\ncount(clinton_timeline, station, wt=value, sort=TRUE) %&gt;%\r\n  mutate(pct=n\/sum(n), pct_lab=sprintf(&quot;%s (%s)&quot;, scales::comma(n), scales::percent(pct)),\r\n         station=factor(station, levels=rev(station))) -&gt; timeline_df\r\n\r\ntimeline_df\r\n\r\n## # A tibble: 7 \u00d7 4\r\n##             station     n         pct        pct_lab\r\n##              &lt;fctr&gt; &lt;int&gt;       &lt;dbl&gt;          &lt;chr&gt;\r\n## 1          FOX News 14807 0.425757663 14,807 (42.6%)\r\n## 2      FOX Business  7607 0.218730232  7,607 (21.9%)\r\n## 3               CNN  5434 0.156248203  5,434 (15.6%)\r\n## 4             MSNBC  4413 0.126890563  4,413 (12.7%)\r\n## 5 Aljazeera America  1234 0.035482201   1,234 (3.5%)\r\n## 6         Bloomberg   980 0.028178734     980 (2.8%)\r\n## 7              CNBC   303 0.008712404     303 (0.9%)<\/code><\/pre>\n<p>NOTE: I had to break up the queries since the bulk one across the two dates bump up against the API limits and may be providing helper functions for that before CRAN release.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?ssl=1\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"4973\" data-permalink=\"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/clinton_plot-1-2\/\" data-orig-file=\"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&amp;ssl=1\" data-orig-size=\"1536,768\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"clinton_plot-1\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=510%2C255&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?resize=510%2C255&#038;ssl=1\" alt=\"\" width=\"510\" height=\"255\" class=\"aligncenter size-full wp-image-4973\" \/><\/a><\/p>\n<p>While my package matches the total from the news article and sample query: 34,778 results my percentages are different since it&#8217;s the percentages across the raw counts for the included stations. &#8220;Percent of Sentences&#8221; (result &#8220;n&#8221; divided by the number of all sentences for each station in the time frame) &mdash; which the author used &mdash; seems to have some utility so I&#8217;ll probably add that as a query parameter or add a new function.<\/p>\n<h3>Tidy news text<\/h3>\n<p>The package also is designed to work with the <code>tidytext<\/code> package (it&#8217;s on CRAN) and provides a <code>top_text()<\/code> function which can return a <code>tidytext<\/code>-ready tibble or a plain character vector for use in other text processing packages. If you were curious as to whether this API has good data behind it, we can take a naive peek with the help of <code>tidytext<\/code>:<\/p>\n<pre id=\"nfclinton02\"><code class=\"language-r\">library(tidytext)\r\n\r\ntops &lt;- map_df(emails, top_text)\r\nanti_join(tops, stop_words) %&gt;% \r\n  filter(!(word %in% c(&quot;clinton&quot;, &quot;hillary&quot;, &quot;server&quot;, &quot;emails&quot;, &quot;mail&quot;, &quot;email&quot;,\r\n                       &quot;mails&quot;, &quot;secretary&quot;, &quot;clinton&#039;s&quot;, &quot;secretary&quot;))) %&gt;% \r\n  count(word, sort=TRUE) %&gt;% \r\n  print(n=20)\r\n\r\n## # A tibble: 26,861 \u00d7 2\r\n##             word     n\r\n##            &lt;chr&gt; &lt;int&gt;\r\n## 1        private 12683\r\n## 2     department  9262\r\n## 3            fbi  7250\r\n## 4       campaign  6790\r\n## 5     classified  6337\r\n## 6          trump  6228\r\n## 7    information  6147\r\n## 8  investigation  5111\r\n## 9         people  5029\r\n## 10          time  4739\r\n## 11      personal  4514\r\n## 12     president  4448\r\n## 13        donald  4011\r\n## 14    foundation  3972\r\n## 15          news  3918\r\n## 16     questions  3043\r\n## 17           top  2862\r\n## 18    government  2799\r\n## 19          bill  2698\r\n## 20      reporter  2684<\/code><\/pre>\n<p>I&#8217;d say the API is doing <em>just fine<\/em>.<\/p>\n<h3>Fin<\/h3>\n<p>The package also has some other bits from the API in it and if this has piqued your interest, please leave all package feature requests or problems as a github issue.<\/p>\n<p>Many thanks to the Internet Archive \/ GDELT for making this API possible. Data like this would be amazing in any time, but is almost invaluable now.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I was enthused to see a mention of this on the GDELT blog since I&#8217;ve been working on an R package dubbed newsflash to work with the API that the form front-ends. Given the current climate, I feel compelled to note that I&#8217;m neither a Clinton supporter\/defender\/advocate nor a ? supporter\/defender\/advocate) in any way, shape [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":4973,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":3,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"","footnotes":""},"categories":[91],"tags":[810],"class_list":["post-4968","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-r","tag-post"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Exploring News Coverage With newsflash - rud.is<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Exploring News Coverage With newsflash - rud.is\" \/>\n<meta property=\"og:description\" content=\"I was enthused to see a mention of this on the GDELT blog since I&#8217;ve been working on an R package dubbed newsflash to work with the API that the form front-ends. Given the current climate, I feel compelled to note that I&#8217;m neither a Clinton supporter\/defender\/advocate nor a ? supporter\/defender\/advocate) in any way, shape [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/\" \/>\n<meta property=\"og:site_name\" content=\"rud.is\" \/>\n<meta property=\"article:published_time\" content=\"2017-02-02T04:54:12+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2018-03-10T12:55:44+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"1536\" \/>\n\t<meta property=\"og:image:height\" content=\"768\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"hrbrmstr\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"hrbrmstr\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/\"},\"author\":{\"name\":\"hrbrmstr\",\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/#\\\/schema\\\/person\\\/d7cb7487ab0527447f7fda5c423ff886\"},\"headline\":\"Exploring News Coverage With newsflash\",\"datePublished\":\"2017-02-02T04:54:12+00:00\",\"dateModified\":\"2018-03-10T12:55:44+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/\"},\"wordCount\":393,\"commentCount\":3,\"publisher\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/#\\\/schema\\\/person\\\/d7cb7487ab0527447f7fda5c423ff886\"},\"image\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2017\\\/02\\\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1\",\"keywords\":[\"post\"],\"articleSection\":[\"R\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/\",\"url\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/\",\"name\":\"Exploring News Coverage With newsflash - rud.is\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2017\\\/02\\\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1\",\"datePublished\":\"2017-02-02T04:54:12+00:00\",\"dateModified\":\"2018-03-10T12:55:44+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#primaryimage\",\"url\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2017\\\/02\\\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1\",\"contentUrl\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2017\\\/02\\\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1\",\"width\":1536,\"height\":768},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/2017\\\/02\\\/01\\\/exploring-news-coverage-with-newsflash\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/rud.is\\\/b\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Exploring News Coverage With newsflash\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/#website\",\"url\":\"https:\\\/\\\/rud.is\\\/b\\\/\",\"name\":\"rud.is\",\"description\":\"&quot;In God we trust. All others must bring data&quot;\",\"publisher\":{\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/#\\\/schema\\\/person\\\/d7cb7487ab0527447f7fda5c423ff886\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/rud.is\\\/b\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\\\/\\\/rud.is\\\/b\\\/#\\\/schema\\\/person\\\/d7cb7487ab0527447f7fda5c423ff886\",\"name\":\"hrbrmstr\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ukr-shield.png?fit=460%2C460&ssl=1\",\"url\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ukr-shield.png?fit=460%2C460&ssl=1\",\"contentUrl\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ukr-shield.png?fit=460%2C460&ssl=1\",\"width\":460,\"height\":460,\"caption\":\"hrbrmstr\"},\"logo\":{\"@id\":\"https:\\\/\\\/i0.wp.com\\\/rud.is\\\/b\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ukr-shield.png?fit=460%2C460&ssl=1\"},\"description\":\"Don't look at me\u2026I do what he does \u2014 just slower. #rstats avuncular \u2022 ?Resistance Fighter \u2022 Cook \u2022 Christian \u2022 [Master] Chef des Donn\u00e9es de S\u00e9curit\u00e9 @ @rapid7\",\"sameAs\":[\"http:\\\/\\\/rud.is\"],\"url\":\"https:\\\/\\\/rud.is\\\/b\\\/author\\\/hrbrmstr\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Exploring News Coverage With newsflash - rud.is","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/","og_locale":"en_US","og_type":"article","og_title":"Exploring News Coverage With newsflash - rud.is","og_description":"I was enthused to see a mention of this on the GDELT blog since I&#8217;ve been working on an R package dubbed newsflash to work with the API that the form front-ends. Given the current climate, I feel compelled to note that I&#8217;m neither a Clinton supporter\/defender\/advocate nor a ? supporter\/defender\/advocate) in any way, shape [&hellip;]","og_url":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/","og_site_name":"rud.is","article_published_time":"2017-02-02T04:54:12+00:00","article_modified_time":"2018-03-10T12:55:44+00:00","og_image":[{"width":1536,"height":768,"url":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1","type":"image\/png"}],"author":"hrbrmstr","twitter_card":"summary_large_image","twitter_misc":{"Written by":"hrbrmstr","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#article","isPartOf":{"@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/"},"author":{"name":"hrbrmstr","@id":"https:\/\/rud.is\/b\/#\/schema\/person\/d7cb7487ab0527447f7fda5c423ff886"},"headline":"Exploring News Coverage With newsflash","datePublished":"2017-02-02T04:54:12+00:00","dateModified":"2018-03-10T12:55:44+00:00","mainEntityOfPage":{"@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/"},"wordCount":393,"commentCount":3,"publisher":{"@id":"https:\/\/rud.is\/b\/#\/schema\/person\/d7cb7487ab0527447f7fda5c423ff886"},"image":{"@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1","keywords":["post"],"articleSection":["R"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/","url":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/","name":"Exploring News Coverage With newsflash - rud.is","isPartOf":{"@id":"https:\/\/rud.is\/b\/#website"},"primaryImageOfPage":{"@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#primaryimage"},"image":{"@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1","datePublished":"2017-02-02T04:54:12+00:00","dateModified":"2018-03-10T12:55:44+00:00","breadcrumb":{"@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#primaryimage","url":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1","contentUrl":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1","width":1536,"height":768},{"@type":"BreadcrumbList","@id":"https:\/\/rud.is\/b\/2017\/02\/01\/exploring-news-coverage-with-newsflash\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/rud.is\/b\/"},{"@type":"ListItem","position":2,"name":"Exploring News Coverage With newsflash"}]},{"@type":"WebSite","@id":"https:\/\/rud.is\/b\/#website","url":"https:\/\/rud.is\/b\/","name":"rud.is","description":"&quot;In God we trust. All others must bring data&quot;","publisher":{"@id":"https:\/\/rud.is\/b\/#\/schema\/person\/d7cb7487ab0527447f7fda5c423ff886"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/rud.is\/b\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/rud.is\/b\/#\/schema\/person\/d7cb7487ab0527447f7fda5c423ff886","name":"hrbrmstr","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2023\/10\/ukr-shield.png?fit=460%2C460&ssl=1","url":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2023\/10\/ukr-shield.png?fit=460%2C460&ssl=1","contentUrl":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2023\/10\/ukr-shield.png?fit=460%2C460&ssl=1","width":460,"height":460,"caption":"hrbrmstr"},"logo":{"@id":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2023\/10\/ukr-shield.png?fit=460%2C460&ssl=1"},"description":"Don't look at me\u2026I do what he does \u2014 just slower. #rstats avuncular \u2022 ?Resistance Fighter \u2022 Cook \u2022 Christian \u2022 [Master] Chef des Donn\u00e9es de S\u00e9curit\u00e9 @ @rapid7","sameAs":["http:\/\/rud.is"],"url":"https:\/\/rud.is\/b\/author\/hrbrmstr\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/02\/clinton_plot-1-1.png?fit=1536%2C768&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/p23idr-1i8","jetpack_likes_enabled":true,"jetpack-related-posts":[{"id":6558,"url":"https:\/\/rud.is\/b\/2017\/10\/01\/retrieve-process-tv-news-chyrons-with-newsflash\/","url_meta":{"origin":4968,"position":0},"title":"Retrieve &#038; process TV News chyrons with newsflash","author":"hrbrmstr","date":"2017-10-01","format":false,"excerpt":"The Internet Archive recently announced a new service they've dubbed 'Third Eye'. This service scrapes the chyrons that annoyingly scroll across the bottom-third of TV news broadcasts. IA has a vast historical archive of TV news that they'll eventually process, but --- for now --- the more recent broadcasts from\u2026","rel":"","context":"In &quot;R&quot;","block_context":{"text":"R","link":"https:\/\/rud.is\/b\/category\/r\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/10\/chy01.png?fit=1200%2C594&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/10\/chy01.png?fit=1200%2C594&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/10\/chy01.png?fit=1200%2C594&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/10\/chy01.png?fit=1200%2C594&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/10\/chy01.png?fit=1200%2C594&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":6226,"url":"https:\/\/rud.is\/b\/2017\/09\/09\/teasing-out-top-daily-topics-with-gdelts-television-explorer\/","url_meta":{"origin":4968,"position":1},"title":"Teasing Out Top Daily Topics with GDELT&#8217;s Television Explorer","author":"hrbrmstr","date":"2017-09-09","format":false,"excerpt":"Earlier this year, the GDELT Project released their Television Explorer that enabled API access to closed-caption tedt from television news broadcasts. They've done an incredible job expanding and stabilizing the API and just recently released \"top trending tables\" which summarise what the \"top\" topics and phrases are across news stations\u2026","rel":"","context":"In &quot;R&quot;","block_context":{"text":"R","link":"https:\/\/rud.is\/b\/category\/r\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/09\/Plot_Zoom.png?fit=1173%2C1200&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/09\/Plot_Zoom.png?fit=1173%2C1200&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/09\/Plot_Zoom.png?fit=1173%2C1200&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/09\/Plot_Zoom.png?fit=1173%2C1200&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/09\/Plot_Zoom.png?fit=1173%2C1200&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":11911,"url":"https:\/\/rud.is\/b\/2019\/02\/14\/using-the-ropendata-r-package-to-access-petabytes-of-free-internet-telemetry-data-from-rapid7\/","url_meta":{"origin":4968,"position":2},"title":"Using the ropendata R Package to Access Petabytes of Free Internet Telemetry Data from Rapid7","author":"hrbrmstr","date":"2019-02-14","format":false,"excerpt":"I've got a post up over at $DAYJOB's blog on using the ropendata? package to access the ginormous and ever-increasing amount of internet telemetry (scan) data via the Rapid7 Open Data API. It's super-R-code-heavy but renders surprisingly well in Ghost (the blogging platform we use at work) and covers everything\u2026","rel":"","context":"In &quot;R&quot;","block_context":{"text":"R","link":"https:\/\/rud.is\/b\/category\/r\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":4622,"url":"https:\/\/rud.is\/b\/2016\/08\/08\/counting-u-s-expatriation-with-r-a-k-a-a-decade-of-desertion\/","url_meta":{"origin":4968,"position":3},"title":"Counting [U.S.] Expatriation with R (a.k.a. a Decade of Desertion)","author":"hrbrmstr","date":"2016-08-08","format":false,"excerpt":"2016-08-13 UPDATE: Fortune has a story on this and it does seem to be tax-related vs ideology. @thosjleeper suggested something similar as well about a week ago. If you're even remotely following the super insane U.S. 2016 POTUS circus election you've no doubt seen a resurgence of _\"if X gets\u2026","rel":"","context":"In &quot;R&quot;","block_context":{"text":"R","link":"https:\/\/rud.is\/b\/category\/r\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2016\/08\/RStudio-1.png?fit=1200%2C669&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2016\/08\/RStudio-1.png?fit=1200%2C669&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2016\/08\/RStudio-1.png?fit=1200%2C669&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2016\/08\/RStudio-1.png?fit=1200%2C669&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2016\/08\/RStudio-1.png?fit=1200%2C669&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":4942,"url":"https:\/\/rud.is\/b\/2017\/01\/26\/one-view-of-the-impact-of-the-new-immigration-ban-freeing-pdf-data-with-tabulizer\/","url_meta":{"origin":4968,"position":4},"title":"One View of the Impact of the New Immigration Ban (+ freeing PDF data with tabulizer)","author":"hrbrmstr","date":"2017-01-26","format":false,"excerpt":"Dear Leader has made good on his campaign promise to \"crack down\" on immigration from \"dangerous\" countries. I wanted to both see one side of the impact of that decree \u2014 how many potential immigrants per year might this be impacting \u2014 and show toss up some code that shows\u2026","rel":"","context":"In &quot;R&quot;","block_context":{"text":"R","link":"https:\/\/rud.is\/b\/category\/r\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/01\/RStudio.png?fit=1200%2C530&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/01\/RStudio.png?fit=1200%2C530&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/01\/RStudio.png?fit=1200%2C530&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/01\/RStudio.png?fit=1200%2C530&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2017\/01\/RStudio.png?fit=1200%2C530&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":7803,"url":"https:\/\/rud.is\/b\/2018\/01\/15\/cant-stop-at-21-twitter-recipe-22-tying-up-loose-threads\/","url_meta":{"origin":4968,"position":5},"title":"Can&#8217;t Stop at 21: Twitter Recipe #22 \u2014 Tying Up Loose Threads","author":"hrbrmstr","date":"2018-01-15","format":false,"excerpt":"NOTE: The likelihood of this recipe being added to the recent practice bookdown book is slim, but I'll try to keep the same format for the blog post. Problem You want to collect all the tweets in a Twitter tweet thread Solution Use a few key functions in rtweet to\u2026","rel":"","context":"In &quot;R&quot;","block_context":{"text":"R","link":"https:\/\/rud.is\/b\/category\/r\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2018\/01\/Screen-Shot-2018-01-15-at-3.01.13-PM.png?fit=962%2C1200&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2018\/01\/Screen-Shot-2018-01-15-at-3.01.13-PM.png?fit=962%2C1200&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2018\/01\/Screen-Shot-2018-01-15-at-3.01.13-PM.png?fit=962%2C1200&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/rud.is\/b\/wp-content\/uploads\/2018\/01\/Screen-Shot-2018-01-15-at-3.01.13-PM.png?fit=962%2C1200&ssl=1&resize=700%2C400 2x"},"classes":[]}],"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/posts\/4968","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/comments?post=4968"}],"version-history":[{"count":0,"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/posts\/4968\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/media\/4973"}],"wp:attachment":[{"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/media?parent=4968"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/categories?post=4968"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/rud.is\/b\/wp-json\/wp\/v2\/tags?post=4968"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}