10 Distributions
150,617 Total Packages
2,816,373 Releases
2,885 Upstream Packages
Welcome
OpenSourceWatershed is a project aimed at understanding the relationship between distributions (downstream) and the individual software components (upstream). It is the basis for a larger study of distributions and their evolution. It is distrology. In the future, more distro oriented statistics will be available. More details are below. For now search in the top right for your favorite package to see how up to date the different distributions. Or look at the right to see what new releases happened in the last 24 hours.

The aggregate analysis is done over twenty packages including firefox, gcc and openssh. The full package list is in the OSCON slides. In the future, users will be able to set custom groups. The three forms of analysis are percent obsolete, the average number of newer releases per package and the time since the oldest new release. In other words the lag is the amount of time a distro had to move to a newer package.

There are errors in the database which you can help fix. Just email me if you find one. In the future, you will be able to fix it yourself. For more information about the process behind this analysis please read my senior thesis or email me.
Current Distros
Rank Distro Codename % Obsolete Avg # New Rels Avg Lag
1 arch 40.0% 1.60 9w
2 ubuntu precise 55.0% 5.35 21w
3 fedora 16 65.0% 4.15 23w
4 freebsd 9 70.58% 10.71 66w
5 gentoo 75.0% 7.40 21w
6 funtoo 75.0% 8.35 22w
7 sabayon 5 80.0% 11.75 36w
8 opensuse 12.1 85.0% 13.20 41w
9 slackware 13.37 88.23% 15.71 55w
10 debian squeeze 90.0% 34.10 84w
Future Distros
Rank Distro Codename % Obsolete Avg # New Rels Avg Lag
1 arch 35.0% 1.45 9w
2 gentoo 40.0% 1.15 5w
3 funtoo 40.0% 1.00 2w
4 opensuse 11.4 45.0% 3.50 13w
5 ubuntu quantal 50.0% 9.95 55w
6 debian wheezy 55.0% 3.90 20w
7 fedora 17 65.0% 3.75 15w
8 slackware current 66.66% 4.83 23w
9 freebsd 10 68.75% 9.94 42w
10 sabayon 5 75.0% 11.30 26w
Future
Much more work will happen to OSWatershed in the future. The most urgent addition is distro pages which will feature data on multiple branches. User accounts are also coming in the future. Users will be able to add more data into the database and configure their own package groups to use for analysis. Ideally, spreading the work over a larger number of users (crowd-sourcing) will make the data scope more manageable. To get Scott working on these new feature email him and let him know you are waiting!

©2009 Scott Shawcroft | CC-by 3.0 US - contact - git - trac