Many thanks to all who attended the talk @jayjacobs & I gave at @Secure360 on Wednesday, May 15, 2013. As promised, here are the slides.
We’ve enumerated quite a bit of non-slide-but-in-presentation information that we wanted to aggregate into a blog post so you can vi[sz] along at home. If you need more of a guided path, I strongly encourage you to take a look at some of the free courses over at Coursera.
For starters, here’s a bit.ly bundle of data analysis & visualization bookmarks that @dseverski & I maintain. We’ve been doing (IMO) a pretty good job adding new resources as they come up and may have some duplicates to the ones below.
People Mentioned
- Stephen Few’s Perceptual Edge blog : Start from the beginning to learn from a giant in information visualization
- Andy Kirk’s Visualising Data blog (@visualisingdata) : Perhaps the quintessential leader in the modern visualization movement.
- Mike Bostock’s blog (@mbostock) : Creator of D3 and producer of amazing, interactive graphics for the @NYTimes
- Edward Tufte’s blog : The father of what we would now identify as our core visualization principles & practices.
- Nathan Yau’s Flowing Data blog : Making visualization accessible, practical and repeatable.
- Data Stories Podcast : Yes, you can learn much about data visualization from an audio podacst (@datastories)
- storytelling with data (@storywithdata) : Extremely practical blog by Cole Nussbaumer that will especially help folks “stuck” in Excel
- Jay’s blog
- My {this} blog
Tools Mentioned
- R : Jay & I probably use this a bit too much as a hammer (i.e. treat every data project as a nail) but it’s just far too flexible and powerful to not use as a go-to resource
- RStudio : An amazing IDE for R. I, personally, usually despise IDEs (yes, I even dislike Xcode), but RStudio truly improves workflow by several orders of magnitude. There are both desktop and server versions of it; the latter gives you the ability to setup a multi-user environment and use the IDE from practically anywhere you are. RStudio also makes generating reproducible research a joy with built-in easy access to tools like kintr.
- iPython : This version of Python takes an already amazing language and kicks it up a few notches. It brings it up to the level of R+RStudio, especially with it’s knitr-like iPython Notebooks for–again–reproducible research.
- SecViz : Security-centric Visualization Site & Tools by @raffaelmarty
- Mondrian : This tool needs far more visibility. It enables extremely quick visualization of even very large data sets. The interface takes a bit of getting used to, but it’s faster then typing R commands or fumbling in Excel.
- Tableau : This tool may be one of the most accessible, fast & flexible ways to explore data sets to get an idea of where you need to/can do further analysis.
- Processing : A tool that was designed from the ground up to help journalists create powerful, interactive data visualizations that you can slipstream directly onto the web via the Processing.js library.
- D3 : The foundation of modern, data-driven visualization on the web.
- Gephi : A very powerful tool when you need to explore networks & create beautiful, publication-worthy visualizations.
- MongoDB : NoSQL database that’s highly & easily scaleable without a steep learning curve.
- CRUSH Tools by Google : Kicks up your command-line data munging.



For those that wanted to play along at home, I’ve cleaned up the text and made the Wait Wait…Don’t Pwn Me! closing segment of SOURCE Boston 2013 



