Confessions of a Data Junkie: Fetching, Parsing, and Visualization

The video titled "Confessions of a Data Junkie: Fetching, Parsing, and Visualization" features Bobby Wilson discussing his passion for data visualization, emphasizing the process of creating meaningful insights from complex data sets. Wilson, who works at a startup named Next, shares his journey in analyzing data sourced from government repositories and code repositories.

Key points discussed in the presentation include:

- Interest in Data Visualization: Bobby highlights the growing importance of data visualization in technology, driven by the abundance of available data.

- Data Sources and Tools: He mentions various tools and libraries, including Grit for interacting with GitHub repositories, and pointed out the challenges of dealing with sparse and unstructured data, particularly from government sources.

- Data Persistence: Wilson emphasizes the need for a persistence layer in data analysis, detailing his use of MongoDB and JSON for easy data management and manipulation of commit history from code repositories.

- Simple Queries: He breaks down the process of data analysis into manageable queries, illustrating how to extract useful insights from repositories with simple MongoDB commands.

- Visualization Libraries: Various visualization libraries are discussed, notably Raphaël and Protovis, with a focus on the syntax and efficiency of creating engaging visualizations. He even introduces RubyViz, which integrates Protovis within Ruby for generating server-side SVG visualizations.

- Commit Analysis: Wilson demonstrates how to analyze commit data over time, showcasing trends in codebase growth and contributor activity through visual representations like bar charts and commit maps.

- Engagement with the Audience: He connects with the audience by contrasting the appeal of visualizations with the complexities of raw government data, making a case for visualization as a means of clearer communication.

- Practical Applications: Bobby concludes by expressing his desire to make visualization tools more accessible, suggesting that future services could automate this process for users trying to visualize their repository's data.

Ultimately, Wilson's talk emphasizes the accessibility of data visualization techniques and tools while advocating for a structured approach to transforming raw data into clear, insightful visual formats.

Confessions of a Data Junkie: Fetching, Parsing, and Visualization
Bobby Wilson • February 17, 2015 • Earth

By, Bobby Wilson
This is not a scientific talk. It's a survey of each step in my process of creating something meaningful out of a pile of data. The talk will be a blend of anecdotes, process, data sources, tools available, and good ol' hacks. We live in a world filled with recorded data. Lots of that data is online, and retrievable in some fashion. Unfortunately, that data is often sparse, poorly labeled, and uninsured. Enter Ruby, with a wide variety of gems and libraries to help us trudge through funky data, store it, sort it out, and spit shine it for the masses.

Help us caption & translate this video!

http://amara.org/v/GJR0/

MountainWest RubyConf 2011