owned this note
owned this note
Published
Linked with GitHub
---
outline: deep
---
# OSS Data Analytics (1)
## Overview
The China Open Source Annual Report is based on in-depth and comprehensive data insights and is divided into eight major parts. The 1st part, **General Overall Macro Insights**, provides an overview of China's global open-source ecology through an in-depth analysis of essential events, active repositories, active users, open-source licensing, and programming languages. The 2nd part, **OpenRank Rank List**, is the list of open source projects, enterprises, foundations, developers, and collaborative robots in all areas of the world and China, and provides a comprehensive and systematic OpenRank indicator information service for industry. Part 3 and Part 4 contain **Enterprise Insights** and **Foundation Insights**, which illustrate the evolution of global and Chinese enterprises and foundations in the open source area through evolution maps and trend analyses. Part 5 **Technology Sector Insights** provides an in-depth study on the evolution of the Top 10 lists and projects in each area, showing the direction and trends in forward technology. Part 6 **Open Source Project Insights** provides insights into the diversity and innovative directions of different project types, areas, and topics. Part 7 **Open Source Developer Insights** An analysis of developer types, hours of work, geographical distribution, and robotic use shows the diversity and characteristics of the developer community. Part 8, **Case Studies**, provides a series of interesting case analyses that allow readers to glimpse China's exponential ecological boom. Overall, the data page offers a panorama of China's open-source ecology in 2023 through rich data insights and analyses.
### Introduction to indicators
**OpenRank**
The OpenRank indicator is a collaborative network indicator developed by the X-lab Open Laboratory and based on an open source developer-project collaborative relationships network, which not only characterizes the overall development of projects community participation but also introduces elements of open source ecology, which can be well identified and displayed by such entities as projects, people, organizations, etc. in open source ecology. OpenRank is now widely accepted by industry and academia, including the China Institute for Standardization (ISI) series of Open Source Governance Standards, the ICT White Paper on Open Source Governance, the Open Atomic Open Source Foundation Global Open Source Screen, and the Business Open Source Office Governance Toolkit.
For a definition of this indicator, refer to:
[1] [Shengyu Zhao et al.: OpenRank Leaderboard: Motivating Open Source Collections Through Social Network Evaluation in Alibaba. ICSE, 2024] (https\://www\.researchgate.net/publication/3766686121_OpenRank_Leaderboard_Motivating_Open_Source_Collections_Through_Social_Network_Evaluation_in_Alibaba)
[2] [Zhao Honghou: How to evaluate an open source project (iii) value stream, 2021] (https\://blog.frankzhao.cn/how_to_measure_open_source_3)
[3] Institute for Standardization of the Ministry of Industry and Information: Information Technology Open Source Governance Part 3:Community Governance and Operationalisation [T/CESA 1270.3-2023]; Information Technology Open Source Governance Part 5:Evaluation Model for Open Source Contributors" [T/CESA 1270.5-2023], 2023
**Activity**
Activity is a statistical indicator of the level of activity of the X-lab researcher or developer. Developer activity is weighted by the behavior of developers, such as Issue, PR, and Code Review. The project's activity is processed by the sum of the total activity of all developers in the project.
For a definition of this indicator, refer to:
[1] [Xiaoya Xia et al: Exploring activity and contributors on GitHub: Who, what, when, and when. APSEC, 2023](https://ieeexplore.iee.org/abstract/document/10043221)
[2] [Frank Zhao:How to evaluate an open source project (i) - activity,2021](https://blog.frankzhao.cn/how_to_measure_open_source_1)
## 1. Overall Macro Insight
### 1.1 Basic Events
**Basic events** are the database for this data page analysis and refer to a series of event log data generated by developer activity on GitHub, Gitee, and others on the global open-source collaborative platform. A statistical analysis of underlying events provides a macro insight into the dynamics of global ecological development. This annual open-source report covers the collaborative platforms GitHub, Gitee, and GitLink.
#### 1.1.1 Trends in events across GitHub
First, the total number of events logs for statistical analysis across GitHub is shown in the graph below.

<center>Figure 1.1 Trends in GitHub annual events </center>
<br>
The overall activity of global open sources and the number of active warehouses have increased significantly in recent years, reflecting the growth rate in global open-source development.2023 GitHub log data reached 1.4 billion compared to 2022 when it increased by about 10.32 percent. After high growth in 2018-2020, the GitHub platform's annual event growth gradually declined, with a growth rate of about 10% in 2023. However, the 10 percent growth rate, because of its overall volume, continues to highlight open-source technology's dynamic and critical role in the global digital transition.
#### 1.1.2 Comparison of overall events trends in GitHub and Gitee
Because of the size of the events active on the GitHub platform, the subsequent analysis was built on the benchmark of the top 30,000 active warehouses per platform. For ease of comparison, we have selected GitHub for statistical analysis of 8 categories of events of greater relevance to open source participation in Gitee, including CommunityCommentEvent, ForkEvent, IssueCommentEvent, IssuesEvent, FullRequestEvent, FullRequestReviewCommentEvent, PushEvent, and WatchEvent.

<center>Figure 1.2 GitHub and Gitee Active Repository Events </center>
<br>
The Gitee platform showed a more pronounced growth trend. Even since 2021, the number of incidents in the top 30,000 active warehouses has surpassed GitHub, highlighting the outbreak of active open-source projects in the country. Domestic developers' active participation and contribution to open-source communities have injected new dynamism into technological innovation and knowledge sharing.
However, it must be emphasized that data on the first 30,000 active projects alone does not fully reveal the reality of the global GitHub platform, as the long-end effects are still evident globally. Subsequent analyses will reflect this more clearly, especially in the broad and diverse nature of the GitHub platform as the world's leading open-source community. In the future, with the evolution of technology and the promotion of an open-source culture, the Chinese open-source community can be expected to continue to flourish globally.
Further to the analysis of disaggregated data on underlying events, the results are shown in the figure below.
[1-3](/image/data/chapter_1/1-3.png)
<center>Figure 1.3 GitHub vs. Gitee Active Repository Event Types </center>
<br>
Can be seen from the analytics results:
The most frequent event type on the GitHub platform is the Push event, while Pull Request events and Issue Comment events rank 2nd and 3rd, respectively. The occurrence rates of each event type have remained relatively stable, reflecting a trend towards a stable ecosystem in GitHub's open-source community.
On the Gitee platform, event data grew significantly in 2020, initially focusing on Watch events. But after 2020, Pull Request and Review Events grew rapidly, becoming the largest event type in 2022 and growing steadily in 2023. The structural changes in Gitee event data reflect a significant shift in the role of domestic developers from a watchdog to a contributor, which is consistent with observations worldwide.
#### 1.1.3 GitLink Events Analysis
For the GitLink platform, we have also selected the top 30,000 active repositories as benchmarks. Given the limitations of the data, only data covering the six types of events—CommunityCommentEvent, ForkEvent, IssueCommentEvent, IssuesEvent, FullRequestEvent, and WatchEvent—were selected for analysis.

<center>Data analysis of events on the GitLink platform </center>
<br>
While the number of active repository events on GitLink still lags behind platforms like GitHub and Gitee, it exhibits a notable upward trend. On the GitLink platform, Issues events and CommitComment events constitute the vast majority of active repository events.
### 1.2 Active Repository
#### 1.2.1 Trends in GitHub total number of active warehouses
The following figure shows the statistical analysis of the overall activity trends of GitHub and Gitee active repositories.

<center>Figure 1.5 Trends in the number of GitHub annual active repositories </center>
<br>
According to overall data for 2023, the total number of active repositories worldwide reached 87.92 million, marking a 4.06% increase from the previous year; this aligns with the overall trend in events, which has been declining annually since experiencing high growth from 2018 to 2020. This decline could stem from the COVID-19 pandemic and global economic developments.
Because of the gap in the number of GitHub and Gitee warehouses, the following analytical work is also based on 30,000 active repositories in front of each platform.
#### 1.2.2 Comparison of the overall activity of GitHub and Gitee
The graph below shows the statistical analysis of GitHub and Gitee's overall activity in the repositories.

<center>Figure 1.6 GitHub vs. Gitee active repository activity </center>
<br>
Looking at the activity data of the top 30,000 active repositories from each platform, the overall activity on the Gitee platform grew rapidly from 2019 onwards. By 2022, it surpassed GitHub and maintained this high-growth trend, revealing the enormous vitality of open-source development in China during this period.

<center>Figure 1.7 GitHub compared to Gitee active repository activity </center>
<br>
Furthermore, the detailed analysis of the composition of the activity reveals the following:
On the GitHub platform, the activity stemming from "Create PR" events comprises nearly half of the total activity, while "Merge PR" events contribute to approximately one-fourth. Reviewing PRs contributes around 10% of the activity, while the combined activity from issue creation and comments nearly matches, accounting for 7%.
On the Gitee platform, the highest activity contribution comes from reviewing PRs, constituting two-thirds of the total activity. Similarly to GitHub, "Merge PR" events follow closely behind in activity contribution, with a proportion comparable to that on the GitHub platform. A surprising finding is that while "Create PR" events contribute the highest proportion of activity on GitHub, they contribute the least on the Gitee platform, accounting for only 2% of the total activity events.
#### 1.2.3 GitHub and Gitee overall active repository OpenRank trends vs.
The graph below shows the statistical analysis of GitHub and Gitee's active repository, OpenRank trends.

<center>Figure 1.8 GitHub vs. Gitee Active Repository OpenRank </center>
<br>
Although the activity of the top 30,000 repositories on Gitee briefly surpassed that of GitHub in 2022, the influence gap measured by OpenRank remains significant (approximately 5:2). Not only is the gap considerable but there also seems to be no indication of it narrowing in terms of trends. This is particularly noteworthy and underscores a key area of focus for future open-source development in China.
### 1.3 Active users
#### 1.3.1 Trends in the total number of active users on GitHub
The following figure presents a statistical analysis of the overall active user count on GitHub.

<center>Figure 1.9 Trends in GitHub annual active users </center>
<br>
In 2023, the total number of active developers in the field reached 21.93 million, an increase of 8.88 percent over the previous year. Like the GitHub active warehouse, after nearly five years of high growth, the growth rate began to decline in 2020. The growth of active users on the GitHub platform began to slow (although the GitHub official announced at the beginning of 2023 that the overall number of users of its platform surpassed 100 million), there was also some correlation with changes in the global situation and the rise of a platform like Gitee.
#### 1.3.2 Active user geographical distribution and ranking
The annual report can include detailed geo-location data analysis for GitHub developers as a contribution to the award-winning game of the OpenDigger Open Source Software Ecological Data Analysis Dredging Platform ([OpenSODA](https://github.com/ECNU/OpenSODA)).
The following analysis is based on approximately 2 million developers who have correctly filled in their geographical location information out of the 10 million active developers on GitHub in 2023. Considering the total registered users on GitHub to be 100 million, the sampling ratio is approximately 2%.
**1. Geographical distribution of global developers**
First, analyze developers' geographical distribution worldwide, as shown in the following chart.

<center> Figure 1.10 Global geographical distribution of developers </center>
<br>
<center> Table 1.1 Global Developer Distribution by Country/Region (Top 15) </center>
<br>
| Ranking | States | Total Number | Percentage | Annual Activity | Active rate |
| :-----: | :------------: | :----------: | :--------: | :-------------: | :---------: |
| 1 | United States | 408983 | 21.09% | 236899 | 57.92% |
| 2 | India | 177669 | 9.16% | 107066 | 60.26% |
| 3 | China | 171039 | 8.82% | 126238 | 73.81% |
| 4 | Brazil | 114855 | 5.92% | 83932 | 73.08% |
| 5 | Germany | 88767 | 4.58% | 64836 | 73.04% |
| 6 | United Kingdom | 83245 | 4.29% | 55175 | 66.28% |
| 7 | Canada | 65241 | 3.36% | 42238 | 64.74% |
| 8 | France | 57480 | 2.96% | 40341 | 70.18% |
| 9 | Russia | 47213 | 2.43% | 31534 | 66.79% |
| 10 | Australia | 31638 | 1.63% | 20512 | 64.83% |
| 11 | Poland | 31469 | 1.62% | 21792 | 69.25% |
| 12 | Japan | 30873 | 1.59% | 21942 | 71.07% |
| 13 | Netherlands | 30617 | 1.58% | 21685 | 70.83% |
| 14 | Spain | 28928 | 1.49% | 19509 | 67.44% |
| 15 | South Korea | 28325 | 1.46% | 21811 | 77.00% |
Overall, developers from various countries are continuously increasing:
- The United States ranks first due to its early involvement in the open-source domain and its advantage in technology talent.
- Based on the calculated total number of developers from the United States in the table (409,000), the actual number of developers from the United States on GitHub is estimated to be around 21.01 million, with a deviation of approximately 4% from the official data released by GitHub (22 million).
- India, China, and Brazil, with their large population bases, rank second, third, and fourth in terms of the number of developers. However, based on the activity rate (annual active users/total users), China has the highest rate among the top four.
- Developers from European countries also constitute a significant force in the open-source community, collectively ranking second in volume.
- According to the official data released by GitHub and Gitee (both around 12 million), the total number of global open-source developers from China is likely to exceed 20 million, roughly equivalent to the number from the United States in quantity alone.
**2. Geographical distribution of Chinese developers**
Further analysis shows the geographical distribution of Chinese developers, as shown in the graph below\.Of these, the data sources are almost 150,000 developers of “China” users who correctly fill out provincial information.

<center> Figure 1.11 Geographical distribution of Chinese developers </center>
<br>
According to data from GitHub 2023 Q3 quarter, the total number of Chinese developers is approximately 18.8 million, which can be estimated on the basis of proportion to the total actual developers in each province.
<center> Table 1.2 Distribution of Chinese Developers (Top 15) </center>
<br>
| Ranking | Provinces | Total Number | National percentage | Actual Total |
| :-----: | :-------: | :----------: | :-----------------: | :-------------: |
| 1 | Beijing | 32982 | 22.04% | 262.25 million |
| 2 | Sengah | 24581 | 16.43% | 1955.45 million |
| 3 | Guangdong | 21684 | 14.49% | 172.41 000 |
| 4 | Zhejiang | 14256 | 9.53% | 113.35 million |
| 5 | Taiwan | 12173 | 8.13% | 96.79 million |
| 6 | Jiangsu | 7335 | 4.90% | 58.32 million |
| 7 | Chechen | 7012 | 4.69% | 55.75 million |
| 8 | Hong Kong | 4678 | 3.13% | 37.19 million |
| 9 | Hubei | 4415 | 2.95% | 35.1 million |
| 10 | Shaanxi | 2815 | 1.88% | 22.38 000 |
| 11 | Fujian | 2405 | 1.61% | 19.12 million |
| 12 | Shandong | 2035 | 1.36% | 16.18 million |
| 13 | Hunan | 1858 | 1.24% | 14.77 000 |
| 14 | Chongqing | 1833 | 1.22% | 1457 000 |
| 15 | Annah | 1487 | 0.99% | 11.82 million |
Ranking and data in the above table reveal the relevance of Chinese open-source developers and regional economic development levels:
- The number of open source developers in the North, Upper and Zhej's four major cities has surpassed one million classes, particularly in Beijing;
- The fifth and eighth places respectively of Taiwan and Hong Kong, highlighting the importance of Hong Kong and the Taiwan Strait;
- The open source developer in the Long Triangle (Jijjiang Zhejushu) region has reached almost 38.8 million;
- The central western regions, such as Sichuan, Hubei and Shaanxi, have also shown good performance, particularly in Sichuan, which has attracted a large number of developers through their suitable, fast-growing software industries.
### 1.4 Open source licenses
#### 1.4.1 Number of warehouses using open-source licenses
The graph below shows the number of open-source licenses that GitHub's active repository uses.
<center>
<img src="/image/data/chapter_1/1-11.png" alt="1-11" width="450px"/>
</center>
<center> Figure 1.12 Number of warehouses using open source licenses </center>
<br>
The analysis revealed that the most used open-source licenses are currently available, including MIT licenses, Apache licenses v2.0, GNU General Public Licence v3.0, and BSD 3-Clause licenses. Of these, MIT licenses rank first to reach 60%. The MIT license is named after the Massachusetts Institute of Technology. The simplicity and flexibility of MIT licenses have made it one of the licenses chosen by many developers and have provided the least legal restrictions to encourage developers to use and disseminate software freely.
#### 1.4.2 Trends in Open-Source Licensing Types
Statistical analysis has been conducted on the trends of open-source license types, as shown in the following figures.

<center> Figure 1.13 Trends in the Number of Open Source License Types </center>
<br>
Overall, the number of open-source license types has continuously increased since 2017. Introducing licenses such as the Eclipse Public License 2.0, the European Union Public License 1.2, and others contributed to the growth observed between 2017 and 2018. Subsequently, the growth rate of open-source license types slowed down. Between 2021 and 2022, a new batch of open-source licenses, such as the Mulan Series Licenses and the CERN License v2, began to emerge. Following this, the development trend stabilized, and currently, the mainstream license types on GitHub have remained steady at 46 types for two years.
### 1.4.3 Trends in the Number of Repositories Using Open Source Licenses
According to Github's log data, in 2023, nearly 7.7 million active repositories used various open-source licenses, accounting for 8.76% of all active repositories. We present the MIT License's data separately due to its significant influence.
**1. Trends in the Number of Repositories Using the MIT License**
Statistical analysis of the trends in the number of repositories using the MIT License is shown in the following figure.

<center> Figure 1.14: Trends in the Number of Repositories Using the MIT License </center>
<br>
Observations:
- The MIT License is currently the most popular open-source license, with 1.58 million active repositories in 2023.
- The trends in the number of repositories using the MIT License are similar to those of the total repository count, with significant growth observed. However, the growth rate slowed down in 2022 and 2023, which correlates with the overall slowdown in project growth.
**2. Trends in the Number of Repositories Using Other Top Five Open Source Licenses**
The following figure shows a statistical analysis of the trends in the number of repositories using other top-five open-source licenses.

<center> Figure 1.15: Trends in the Number of Repositories Using Other Licenses </center>
<br>
Observations:
- The number of open-source licenses is growing, with MIT, Apache, and GNU licenses remaining the top choices.
- Differences between niche and popular open-source licenses still exist.
- Since 2022, the usage of GNU General Public License (GPL) versions 2 and 3 has been declining overall, while GNU Affero General Public License version 3 has been increasing yearly.
#### 1.4.3 Trends in the Number of Repositories Using the Mulan Series Licenses
The following figure shows a statistical analysis of the trends in the number of repositories using the Mulan Series Licenses.

<center> Figure 1.16 Accumulative Trends in the Number of Repositories Using the Mulan Series Licenses </center>
<br>
The Mulan Series Licenses (including the Mulan Permissive Software License and the Mulan Public License, among others) are drafted, revised, and released by Peking University, with the support of the National Standardization Technical Committee on Cloud Computing and the China Open Source Cloud Alliance. As the first open-source software agreement recognized by the Open Source Initiative (OSI) in China, the Mulan Permissive Software License (Mulan PSL) holds significant influence.
Observations indicate a growth in repositories utilizing the Mulan licenses starting September 2022. By December 2023, there were 220 such active repositories, showcasing the increasing influence of Mulan open-source licenses.
### 1.5 Programming Languages
#### 1.5.1 Top Programming Languages Used by Developers in 2023
The popularity of programming languages is of great interest to developers. The analysis below presents the most popular programming languages among developers in 2023, as shown in the following table.
<center> Table 1.3: Top 15 Programming Languages Used by Developers </center>
<br>
| Rank | Programming Language | Number of Developers Using | Number of Repositories Using |
|:-------:|:-----------------------:|:-------------------------------:|:--------------------------------:|
| 1 | JavaScript | 765,589 | 1,806,477 |
| 2 | Python | 629,423 | 653,025 |
| 3 | HTML | 564,121 | 676,364 |
| 4 | TypeScript | 462,729 | 886,453 |
| 5 | Java | 368,795 | 463,660 |
| 6 | CSS | 190,480 | 239,187 |
| 7 | C++ | 177,905 | 135,330 |
| 8 | C# | 158,159 | 180,537 |
| 9 | Go | 143,433 | 165,367 |
| 10 | PHP | 128,186 | 272,980 |
| 11 | Jupyter Notebook | 122,475 | 102,708 |
| 12 | Shell | 122,456 | 108,209 |
| 13 | C | 107,918 | 80,159 |
| 14 | Rust | 69,370 | 72,778 |
| 15 | Ruby | 66,857 | 374,835 |
| 16 | Kotlin | 64,307 | 62,709 |
| 17 | Vue | 56,099 | 170,639 |
| 18 | SCSS | 50,526 | 44,672 |
| 19 | Dart | 46,143 | 43,006 |
| 20 | Swift | 33,839 | 35,978 |
From the table above:
- The top five programming languages most used by developers are JavaScript, Python, HTML, TypeScript, and Java, which represent the leading programming languages developers use. Starting from the sixth-ranked CSS, the number of users decreased by nearly half compared to Java, the fifth-ranked language.
#### 1.5.2 Trends in Programming Language Usage from 2019 to 2023
Statistical analysis of developers' programming language usage trends from 2019 to 2023 is depicted in the following figure.

<center>Figure 1.17: Trends in Programming Language Usage from 2019 to 2023</center>
<br>
Observations from the figure:
- JavaScript, Python, HTML, TypeScript, and Java are the leading programming languages developers use.
- Python and TypeScript have shown rapid growth compared to the other three primary languages and have maintained a consistently rapid growth trend over the past five years.
- TypeScript, in particular, has experienced rapid growth in the number of users over the past five years. In 2021, it significantly surpassed other programming languages, becoming one of the main programming languages developers use. Perhaps by 2024, the number of developers using it will be comparable to the number of developers using HTML, which is ranked third.
## 2. OpenRank Rankings
**Rankings** are a popular form of presenting analysis results.
The 2023 China Open Source Annual Report separates the rankings into a dedicated section for centralized display. This is partly to showcase better the development trends of various entities (repositories/projects, countries/regions, enterprises, foundations, developers, etc.) in the open source ecosystem, and another important reason is the maturation of the OpenRank indicators and the completeness of global data.
With the addition of global data from both GitHub and Gitee this year, we are able to take a global perspective with China's open source as the starting point, allowing the world to see the joint efforts and contributions of Chinese enterprises, foundations, developers, and other entities in developing the global open-source ecosystem, which is not available in other reports on the market.
### 2.1 Global Open Source Repository OpenRank Rankings

<center> Figure 2.1 Global Open Source Project OpenRank Rankings (Top 20) </center>
### 2.2 China Open Source Project OpenRank Rankings

<center> Figure 2.2 China Open Source Project OpenRank Rankings (Top 20) </center>
<br>
> Chinese open-source projects are based on data from the OpenDigger project tags, and a single project may include multiple organizations or repositories on GitHub or Gitee platforms.
### 2.3 Global Enterprise OpenRank Rankings

<center> Figure 2.3 Global Enterprise OpenRank Rankings (Top 20) </center>
<br>
> Enterprise rankings are based on data from OpenDigger project tags, meaning the sum of all open source projects initiated by a certain enterprise's OpenRank, including projects donated to foundations.
### 2.4 China Enterprise OpenRank Rankings

<center> Figure 2.4 China Enterprise OpenRank Rankings (Top 20) </center>
### 2.5 Global Foundation OpenRank Rankings

<center> Figure 2.5 Global Foundation OpenRank Rankings (Top 10) </center>
### 2.6 Country and Region OpenRank Rankings

<center> Figure 2.6 Country and Region OpenRank Rankings (Top 20) </center>
<br>
> Country and region data is based on location information filled in by GitHub developers, with a sample size of the top 10 million OpenRank users globally.
### 2.7 Global Developer OpenRank Rankings

<center> Figure 2.7 Global Developer OpenRank Rankings (Top 30) </center>
### 2.8 China Developer OpenRank Rankings

<center> Figure 2.8 China Developer OpenRank Rankings (Top 30) </center>
<br>
> Chinese developer accounts are based on OpenDigger tag data.
## 3. Enterprise Insights
Enterprises are the core force driving the development of the global open-source ecosystem. They are initiators, as well as developers and maintainers, at the forefront of the development and commercial exploration of open-source projects.
### 3.1 Evolution of Global Enterprise OpenRank Over the Past 10 Years


<center> Figure 3.1 Changes in China Enterprise OpenRank Rankings </center>
<br>
Observations on the global impact of enterprise open source are as follows:
- Microsoft began laying out open source over a decade ago (in 2008) and reached the pinnacle of global open source influence in 2016, a position it has held unchallenged to this day.
- Since being officially sanctioned by the United States in 2019, Huawei has made open source a strategic priority. It has been soaring ever since and surpassed Google and Amazon this year.
- Alibaba has been a leader in domestic open source until 2021 and has maintained its sixth position globally.
- Ant Group's performance in the past three years has been remarkable, and it officially entered the top ten in the world in 2023.
- Baidu, the fourth largest player in domestic open source, has fallen to 12th globally due to rapid changes in the domestic open source landscape.
- According to the [OpenLeaderboard](https://open-leaderboard.x-lab.info/), Chinese enterprises entering the top 30 globally also include ByteDance (18), PingCAP (19), Feizhiyun (24), Deepin (25), Tencent (26), and Espressif (27).
### 3.2 Evolution of China Enterprise OpenRank Over the Past 10 Years

<center> Figure 3.2 Changes in China Enterprise OpenRank Rankings </center>
<br>
This chart effectively demonstrates the open-source strategies of domestic companies and their changing trends:
Huawei began to make efforts in 2019 and, in just two years, achieved first place in China and second place globally.
As traditional domestic leaders in open source, Alibaba and Ant have shown stable performance.
- Baidu has slipped to fourth place due to competition from the first three.
- ByteDance has made visible and rapid progress in recent years.
- Espressif (Espressif Systems) is a relatively low-profile semiconductor open-source leader in China.
- Fit2Cloud is another low-key but pragmatic open-source enterprise, with several open-source software under its belt being highly favored by developers.
- Tencent, PingCAP, JD, and TAOS have shown a slight downward trend in the past two years, indicating that competition in the post-pandemic era will intensify.
### 3.3 Proportion of China Enterprises' OpenRank on GitHub/Gitee Platforms
<div align="center">
<img src="/image/data/chapter_3/3-3.png" alt="3-3" width="300px"/>
<img src="/image/data/chapter_3/3-4.png" alt="3-4" width="400px"/>
</div>
<center> Figure 3.3 Proportion of China Enterprises' OpenRank among Global Enterprises (Left) and Comparison of OpenRank between Chinese and American Enterprises at the Project Level (Right) </center>
<br>
The left chart shows the trend of increasing influence of Chinese enterprises in the global open source ecosystem, while the right chart reflects the trend of ups and downs between China and the United States in the post-trade war era, especially after the pandemic. The influence of Chinese open source has risen significantly, as has the influence of companies like Huawei. However, it can also be seen that the gap between Chinese and American enterprises in overall open source influence is still significant (about 3 times the difference). Still, this momentum is very promising for the future.
## 4. Foundations Insights
This section examines the development of open-source ecology from a foundation perspective. Foundations are non-profit organizations that play a crucial role in organizing, developing, and innovating open-source projects and communities. They provide comprehensive support in technology, operations, and law to incubate open-source software and guide the building and operation of open-source communities. Foundations act as incubators and accelerators and are essential organizers of the open-source ecosystem. This year, we have included a separate section on insights from open-source foundations, where we can see the global impact of China's open-source foundations.
### 4.1 Global Foundation OpenRank trend analysis
<div align=center>
<img src="/image/data/chapter_4/4-1.png" width="700px">
</div>
<center> Figure 4.1 Global Foundation OpenRank Overall Trend </center>
<br>
The following trends can be seen in:
- The Apache Foundation's #1 ranking has evolved at a mature and steady pace, and today it remains the first choice for many companies to develop globalization projects;
- OpenAtom Open Source Foundation was founded more than three years ago, the rapid development of its projects, and the total impact of its projects beyond the Linux Foundation's sub-foundations, ranked second only after the Apache Foundation;
- LF AI & Data ranked third, outpacing CNCF in cloud-native due to advancements in AI.;
- The development of the other (sub)foundations has generally been relatively stable..
### 4.2 Global Foundation project OpenRank trend analysis
<div align=center>
<img src="/image/data/chapter_4/4-2.png" width="700px">
</div>
<center> Figure 4.2 Global Foundation Project OpenRank Trends </center>
<br>
In terms of open source projects under the Global Foundation:
- Kubernetes continues to rank first, but influence declines every year, giving way to projects in emerging areas;
- Doris, an open source real-time data warehouse initiated by Baidu under the Apache Foundation, has grown rapidly in recent years and ranks second;
- OpenHarmony, a project of OpenAtom Open Source Foundation, and its various sub-repositories are a close second. If combined, they would rank #1.
### 4.3 Analysis of Trends in OpenRank Projects under Foundation in China
<div align=center>
<img src="/image/data/chapter_4/4-3.png" width="700px">
</div>
<center> Figure 4.3 Trends in OpenRank Projects under Foundation in China </center>
<br>
Chinese projects under various foundations are examined separately:
- Doris and OpenHarmony are developing most noticeably;
- The Milvus Vector Database has experienced rapid growth due to demand in the AIGC domain;
- Projects like Flink and ShardingSphere are relatively stable.
### 4.4 Analysis of Trends in OpenRank Projects under the Open Atom Foundation
<div align=center>
<img src="/image/data/chapter_4/4-4.png" width="700px">
</div>
<center> Figure 4.4 Trends in OpenRank Projects under the Open Atom Foundation </center>
<br>
This year marks the first time we can observe the development of projects under the Open Atom Flag:
- The top three are OpenHarmony, openEuler, and Anolis, representing the absolute status of the operating system, especially OpenHarmony, which is developing the fastest;
- Other listed projects are developing steadily, and we look forward to their progress in the new year.