Programming Languages On GitHub
Written by Lucy Black   
Saturday, 14 February 2015

What could be better than having a nice interactive display that shows how languages compare? GitHut uses GitHub data to present the state of the languages - on GitHub of course. 

Octocat

There is nothing like contemplating the language choice of others. We spend a lot of time mastering a language and its ecosystem, so it is only reasonable that we have an interest in what else there is.

GitHub has become the main way to host development projects and as such looking at the the number of projects per language is a way to provide information on a language's popularity as judged by its usage. 

The stats give you the number of active repos per language as an overall popularity measure. By looking at total pushes, pushes per repo and new forks per repo, you can get an idea of how active the language sub-GitHub is. You can also see open issues per repo and new watchers per repo, but it is less clear what these indicate. 

As GitHut comments:

"An additional note about the data is about the large amount of records in which the programming language is not specified. This particular characteristic is extremely evident for the Create Events (of repository), therefore it is not possible to visualize the trending language in terms of newly created repositories. For this reason the Activity value (in terms of number of changes pushed) has been considered the best metric for the popularity of programming languages. "

 

 

Click for larger image

githut1

 

If you visit the GitHut website you can select any of the horizontal bars and see the rankings in the other columns. The top language is JavaScript with more repositories and pushes that any other, but notice that the average push per repo and new forks suggest that it isn't quite as active as some other languages. 

Overall the newer languages have smaller numbers of repos, but are much more active in relative terms - which is what you might expect. 

There are also some interesting differences in language behavior. For example, if you look at the statistical language R it doesn't have a lot of repos on GitHub and the number of pushes per repo is comparatively very low but it's number one for new forks - why? Are stats programs more likely to take a whole repo and customize it rather than contribute to the main project?

You can see the same sort of pattern in Apple's new Swift language. 

Of course, no easy stats gathering operation like this is free of error and in this case the big problem is GitHub's categorization of projects based on recognizing file extensions. This is often very wrong. So don't take these results as hard fact. 

 

Octocat

Banner


Meta Launches Five New Professional Certificates on Coursera
23/06/2022

Meta is expanding its training offering on Coursera with five new professional certificates for those wanting to embark on a programming career - for the web, for mobile or as a database engineer. Pre [ ... ]



Rust Adds Source-based Code Coverage
21/06/2022

Rust 1.61 has been released with improvements including custom exit codes from main, along with source-based code coverage. Rust is now safeguarded by the Rust Foundation, a cross-industry body aimed  [ ... ]


More News

 

pythondata

 



 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Sunday, 15 February 2015 )