Data Challenge II Results

In April we announced the second annual GitHub data challenge. Since last year, GitHub’s public timeline data on Google BigQuery has grown by over 80 million events, including 3.8 million new repositories, 38 million pushes, and 8 million comments on issues, pull requests, and commits.

After receiving some amazing entries in the previous challenge, we were excited to see what people would discover with another year of data. The results blew us away: we saw many more entrants and novel applications of our data. GitHubbers ranked their favorite entries, and after tallying the votes, we’re happy to announce the top 3 entries for the 2013 GitHub data challenge.

First Place

The Open Source Report Card, by Dan Foreman-Mackey, analyzes a GitHub user’s contributions to produce a “report card” with statistics and automatically generated prose.

Open Source Report Card

Second Place

How often do people use tabs over spaces in Java? How many commits have lines wrapped to 80 characters? Popular Convention by Outsider uses GitHub data to analyze conventions in selected programming languages.

Popular Convention

Third Place

David Fischer’s visualization of open source contributions by location shows the geographic distribution of contributors behind the 200 most active GitHub repositories.

OSS contributions by location


Congratulations to the winning entries, and huge thanks to everyone who submitted an entry! Our top 3 winners will receive gift certificates to the GitHub Shop for $200, $100, and $50, respectively.

We can’t wait to see what the next data challenge will bring!

Have feedback on this post? Let @github know on Twitter.
Need help or found a bug? Contact us.



Discover new ways to build better

Try Marketplace apps free for 14 days

Learn more