Working with the Clinton State Dept Email Dumps in R (Part 1: Graphs)

I put this together after experimenting with ggplot2 and ggnetwork earlier this week. The changes I made added svgPanZoom into the mix. Consequently, it has a widget in it, so it was just easier to embed the full R markdown HTML into an iframe than to try to extract the content piecemeal into WP.

You can bust the iframe via http://rud.is/projects/clinton_emails_01.html.

Read on to see how to grab some JSON, create edge list, do some basic graph stats with igraph and generate an interactive visualization with ggplot2 and svgPanZoom.

Buy on AmazonDDS Blog
DDS PodcastAmazon Author Page

5 Comments Working with the Clinton State Dept Email Dumps in R (Part 1: Graphs)

  1. Pingback: Working with the Clinton State Dept Email Dumps in R (Part 1: Graphs) – Mubashir Qasim

  2. ryan

    I believe the orphaned mini-graphs are indeed cases where Hillary is CC’d. See document C05791081 as an example.

    Reply
    1. hrbrmstr

      Aye. I’m leaning towards CC or BCC for them now. I have the PDFs and converted them to (cleaned) ASCII. Next post will be working with those and prbly a bit more graphs.

      Reply

Leave a Reply