Close

Leaderboard Seed Bootstrap List (unofficial)

A project log for Statistics for 2017 Hackaday Prize Entries

BeatifulSoup data based on Web scraping projects with tag "2017HackadayPrize", or submitted to the list "2017prize"

RoGeorgeRoGeorge 05/05/2017 at 08:004 Comments

Seed bootstrap unofficial list extracted from the Leaderboard https://hackaday.io/prize after closure of the First Round (Design your concept).

The spreadsheet has 2 pages, one page with Likes per Owner, and one with Likes per Projects:
https://docs.google.com/spreadsheets/d/1_x0qd08opSIXT9HwLRJijpff8HcpeQwOCAOdRWukZ-g/edit?usp=sharing


:o)

Discussions

RoGeorge wrote 05/06/2017 at 13:02 point

Shit! I wanted to put reply under reply under reply to look nice, but instead, I managed to totally disrupt the order of the messages here
:o)

----------------------------

A Web site page with charts and statistics about the whole 2017 prize was what I intended to do since this project was started. The web part is already made, but it's not yet populated with charts and data: https://rogeorge.github.io/2017HackadayPrize_Stats/index.html



The question above was especially about the "tbz" set of files you uploaded here. Apart from the rest of the statistics that I intend to collect in the future, your archive of 50 leaderboard JSONs needs their own charts and tables because of 2 reasons:
- they only contain 4 columns, "project Name", "project ID", "likes" and "award".
- the "likes" column will never change in the future. All the other statistics will have new data every dey, but the leaderboard will never be updated after the end of round one, except for maybe once, when the official seed money will be announced.



For the rest of the statistics, I already have a work in progress, but for the leaderboard part that was supposed to use your data, I made no plans yet, because the leaderboard data would not change in the future, so it would not make sense to add for the future statistics. It would make sense just for the first round of the 2017 prize.

  Are you sure? yes | no

Yann Guidon / YGDES wrote 05/05/2017 at 21:49 point

so you managed to process the leaderboard and make sense of the awarded likes ?

  Are you sure? yes | no

[deleted]

[this comment has been deleted]

Yann Guidon / YGDES wrote 05/06/2017 at 12:22 point

It would be good to automate the processing.

The most easy and powerful way would be to create a web page, and embed all the JSON data inside, probably as a merged super-JSON.

Hmmm this gives me an idea...

From there, it's possible to graph/chart every project's growth, evolution, even display groups of projects to see how the correlate... Some SVG would do :-D

  Are you sure? yes | no

RoGeorge wrote 05/06/2017 at 12:56 point

The Google Sheet was hand made, starting from a JSON saved at 5
minutes after the end of the first round. A "find and replace" was made a
few times in the JSON file, manually, in order to remove all the
unwanted JSON text, and keep just the data. The end result was a CSV
file. Then, the project owner column was bordered, and a new sheet for
"sum of Likes by Owner" was made. The end result was uploaded to Google.



Let's say the files you attached are imported in a spreadsheet, with columns
fileDate/projectName/projectID/likes/award. What kind of representations
do you think it will be interesting to see, I mean, what exactly would
you like to have in a chart/table prepared from the tbz file?

  Are you sure? yes | no