History of GitHub

In the previous chapter, we defined and described the three main categories of version control systems. We documented the history of Git distributed version control system (DVCS) and how the current state of Git is really an outgrowth generally of the history of open source development and specifically of the development of the open source Linux operating system kernel.

Definition: GitHub

GitHub or GitHub.com is a cloud source code management service platform designed to enable scalable, efficient, and secure version controlled collaboration by linking local Git version controlled software development by users. GitHub’s main business footprint is hosting a collection of millions of version controlled code repositories. In addition to being a platform for distributed version control system (DVCS), GitHub’s primary features include code review, project management, continuous integration unit testing, GitHub actions, and associated web page (GitHub pages) and documentation hosting and deployment.

As we described in the last chapter when the development of the Linux kernel separated from using the Bitkeeper distributed version control system (DVCS), Linus Torvalds began work writing his own DVCS software. In the first few years, the Git DVCS software was “self hosting” using a coordinating Linux server. However, it quickly became apparent that a more flexible and scalable central cloud service was needed.

The company, GitHub, Inc., was formed in 2007 by Chris Wanstrath, P. J. Hyett, Tom Preston-Werner and Scott Chacon, and the GitHub.com service was rolled out in February 2008 :cite:`GitHubWiki2020`. Figure 8 below shows the growth in GitHub’s users and repositories from its inception in 2008 to the most recent State of the Octoverse report on the company’s progress as well as the Wikipedia entries for “GitHub:cite:`GitHubWiki2020` and “Timeline of GitHub:cite:`GitHubTimelineWiki2020`. GitHub’s users and repositories have grown exponentially since its inception in 2008.

../../_images/GitHubGrowth.png

Fig. 8 Time series of GitHub growth in users and repositories since 2008. Source: Data come from 2013-2019 State of the Octoverse reports as well as Wikipedia articles “GitHub” :cite:`GitHubWiki2020` and “Timeline of GitHub” :cite:`GitHubTimelineWiki2020`.

Acquisition by Microsoft

The vertical line in Figure 8 at June 4, 2018 highlights the data when Microsoft announced its acquisition of GitHub. Some open source users worried that a large for-profit corporation with a long history of antitrust litigation should control the most important platform in open source software development and collaboration. But by 2018, Microsoft had already become one of the heaviest users of GitHub and was maintaining many of the most actively developed repositories on the platform.

In 2014, Steve Ballmer stepped down as CEO of Microsoft and was replaced by Satya Nadella. Nadella quickly began taking Microsoft in a different direction :cite:`MicrosoftOpenWiki2020`. The focus of the expanded beyond the proprietary Windows operating system and into cloud services (Azure), embedded, and mobline computing. In an October 2014 media event discussing Microsoft’s Azure cloud computing services, Nadella put up a slide that said “Microsoft Loves Linux” and read it to the audience :cite:`McAllister2014`. This statement would have been unthinkable under the previous two Microsoft CEO’s, Steve Ballmer and Bill Gates. But from 2014 to its acquisition of GitHub in 2018, Microsoft had shown a strong commitment to the open source community.

Other GitHub Features

In addition to being a platform for distributed version control system (DVCS), GitHub’s primary features include code review, project management, continuous integration unit testing, GitHub actions, and associated web page (GitHub pages) and documentation hosting and deployment. In fact, this Jupyter Book employs most of these extra features. GitHub’s added features allow it to more easily scale code collaboration and development, both in the open source community and among private enterprise applications.

Competition

Other source code management service platforms exist, but none of them has the size, network, scope, and functionality of GitHub. Bitbucket by Atlassian is GitHub’s closest competitor and has a slightly different pricing strategy than GitHub. And GitLab is also a competitor in this space. But a wide gap separates these other platforms from GitHub.