> Why does my component have a low rate of reuse ?


There can be several reasons for this. Most often it’s because

> your project is an end-user application which is not likely to be embedded, and is not explicitly reused as a result, or

> the uploaded file is not characteristic of your project or that release.

We also observed that projects which tend to release several patched versions will see low reuse rating at the release level – the reuse will be diluted over several releases. In this case, displaying all measures in point form will fix this problem. Likewise, integrating a new release into a project can take several weeks, or may not happen at all.

> Why provide this service to the community?


The measure of reputation, based on monitoring the rate of reuse within a large set of open source projects, is a byproduct of our R&D program. As a frequent user of open source libraries and applications, and in the belief that it could be useful to the community, we thought it natural to provide this service for free.

> Are Antelink products open source?


Antelink is a software vendor. Our market position leaves little room for developing service/consulting activities. As a result, we didn’t opt for an open source model. As users of open source libraries and solutions, we felt it was important to give something back to open source communities. Part of this will involve developing products  and services that can be used for free by open source communities.

> Why a new metric?


Many classifications rank open source projects according to the number of developers, forum activity, or the number of downloads. One metric alone is always questionable and subject to interpretation. Combining several, however, provides a ‘big picture’. New metrics are useful in that they make it possible to characterize new uses.

> How do you define this metric?


We take a file and measure the amount of reuse in a large database of open source projects. Reuse can be direct, involving the explicit integration of the file, or indirect via calls. We improve our algorithms based on the scenarios we encounter, so feel free to share your experiences sending us feed-back.

> Is it use or reuse?


We do measure reuse, and naturally focus on libraries and forked projects. Open source projects dedicated to end-user type applications will not be highly rated.

> Which open source projects are you considering?


Initially, we wanted to start with a representative sample. In terms of the number of hosted projects, the two main forges are Sourceforge and GoogleCode. We are currently downloading content from 162,608 projects – an interesting start in our view. In practice, we use other sources as well such as savannah, RubyForge and OW2.

> Are your statistics updated on a regular basis?


Our engine continually picks up new contents and updates data on projects which are already catalogued. The frequency of updates varies between a few days and a few weeks based on empirical rules.

> How robust is your metric?


We focus part of our R&D work on ensuring the robustness of our metric. This involves taking into account uses which may distort results by counting the same reuse several times. For example, we automatically identify duplicate projects found on both SourceForce and GoogleCode. They are only counted once.

> What kinds of files can be uploaded?


In principle, any type of binary file (jar, tar, png, jpg, etc) or text (code source, readme, licence, etc.). The measure is not used in exactly the same way for every case of course. Special attention must be paid to content associated with reuse (.jar, .dll, .h, .css, .png, etc). The only restriction is on file size (100B – 100Mb max.).

 
 
 
 
 
 
© 2008-2010 antelink. All rights reserved.