I am trying to gauge the popularity of a programming language with:
- The number of stars in the repositories combined with ...
- The programming languages โโused in repo and ...
- Total bytes of code in each language (recognizing that some languages โโare more / less detailed)
Conveniently, there is massive Github information provided by Github Archive and hosted by BigQuery. The only problem is that I donโt see the โlanguageโ available in any of the payloads for the various types of events in Github Archive.
Here's the BigQuery query that I ran, trying to find if and where, the language can be populated with Github Archive data:
SELECT *
FROM [githubarchive:month.201612]
WHERE JSON_EXTRACT(payload, "$.repository.language") is null
LIMIT 100
-, , , Github ? ? , BigQuery github_repos, , . - (.. "" , ).
!