Extending bts-link beyond debbugs (starting with mantis)

We want to be able to extend bts-link (or -like services) so that bugs linked between different bugtrackers can be linked to each-other and the status changes monitored, as part of our tasks in Helios (see LinkedBugsMonitoring).

We’ll probably try and work, until July, on connecting to Mantis as the “distribution/downstream” bugtracker instead of debbugs, and see what architectural changes would be required to test that for the Helios platform specific needs.

I’d like to be able to have a working prototype for July, which would also demonstrate the LinkedData and Semantic Web approach to navigating the bugs of the open source ecosystem, so that we can discuss it at the Debconf. I’ve filed a proposal for a paper at DebConf for this purpose : towards more semantic web into Debian servers (UDD and likes) (see previous post on that matter also).

I suppose that our Mandriva colleagues will be able to move on on bugzilla for SWIM/Mephisto so that we can quickly have very interesting prototypes.

I miss time to describe all that in more details (and will appreciate the coming holidays week 😉 ).

Comments welcome, of course.

UDD export as RDF data through D2R and a corresponding mapping definition

(this is a copy of a post I made on the debian-qa list)

I’ve been pursuing previous ideas about the use of Semantic Web standards for publication of facts about Debian using the UDD database as a source.

FYI, I have a running prototype (unfortunately only on my laptop at the moment, hopefully on the Web some day soon) which uses D2R, a server which can publish RDBMS tables as RDF documents, using a mapping description.
Continue reading “UDD export as RDF data through D2R and a corresponding mapping definition”

UDD, SWIM, Flossmetrics : facts databases about libre software distributions… going Semantic ?

I’ve attended the recent FOSDEM 2009 (great as always), where a number of presentations triggered a lot of my interest.

First @DebianRoom where Lucas presented UDD, the Universal Debian Database. This database groups facts about the Debian project, to ease the creation of queries on what’s happening in the Distribution. This is for instance very helpful for QA tasks, like counting bugs with certain characteristics, or comparing packages in various ways.
Note that a complementary presentation by Enrico was very interesting, on DDE : Debian Data Export (edit: see also his post on DDE), showing ways to offer services to query UDD.

Another presentation, @CrossDesktopRoom introduced the Flossmetrics database, which is collected out of many libre software projects, by extracting contents of the project data from the hosting forges. Very much interesting, in particular since the data becomes available, and a large number of projects allow researchers to compare them in many ways.

Maybe Flossmetrics could benefit from data coming from the Debian UDD… or vice versa ? I think contacts have been taken to think about potential future interchange between the 2.

A general criticism I could make on these two databases is that their schema (the tables & columns layout, as well as the eventual relations), and the code of the data “harvesters” is the only way to understand the real meaning of these data. There’s not so much semantics. Sometimes for known reasons, because, as explained by the UDD developers, there’s actually much incoherence in some of the Debian tools already, and it still it happens to deliver 😉

I’m thinking of a way to produce similar databases of facts (results of queries on these) with Semantic Web standards, to try and convey some bits of commonly agreed semantics, hence fostering interoperability of these databases, and maybe allow comparison of facts relating to different projects. Edit: actually, this idea has been in the air for a year, as readers of a provious post may remember. See in particular the paper by James Howison presented last year at WOPDASD 2008.

It happens that Mandriva, as a followup of the Nepomuk project is indeed trying to setup such a database (called SWIM at the moment. Edit: it has been renamed to MEPHISTO. See my other post on it.) with the use of RDF ontologies, to store facts and annotations about its distribution (more details here). In the HELIOS project, we’ll certainly try and investigate the use of such techniques to try and manipulate such data, like bugs for instance.

I’m thinking about providing an access to UDD with the use of a SWIM-like service, so maybe we can imagine things like more linking of facts about packages, people, bugs and such between Mandriva and Debian, for instance.

Note that at the FOSDEM there were also interesting presentations relating to these kinds of semantic techniques, both relating to outcomes of the Nepomuk project : one about the integration of KDE 4.2 in Debian, where tools like Soprano were mentioned, and another about Tracker in Gnome (which I haven’t attended) about the same kind of techno on the Gnome side.

The future seems semantic, somehow… and we have then a lot of work ahead of us. More to come.

Introduction to bts-link slides

bts-link is a very useful tool which helps keep track of bug status changes when a Debian bug has been marked as linked to another bug in an other (upstream) bugtracker.

I’ve prepared some slides of introduction for our partners in Helios to introduce them to that tool :

These slides can also be found on the helios site (ODP source).

Also, I’ve started contributing to bts-link in the frame of our work on Helios, and I’m glad my contributions have been integrated, although learning git in the way was kind of a pain 😉

More details about bts-link at : http://bts-link.alioth.debian.org/

Is GForge really GForge ? clarification needed

Gforge 4.5 is the FLOSS version of GForge, that most of us can download, install, for instance as Debian packages.

But for some users looking for “the” latest, most up-to-date GForge, there’s a risk to go test (and buy) a non-free version of GForge AS, the “proprietary” version made by the main initial author of GForge, Tim Perdue (through his company GForge Group). While having a new product developed by a company which contributed to the liberation of GForge is not necessarily bad, the problem is that this confusion hinders the momentum for contributions to the free code base.

There seem to be some movement in the GForge community these days, towards more dynamics on the development of this leading FLOSS project, which hosts so many interesting libre projects.

Update 2009/01/25 :Expect some announcement in the forthcoming days on this front. Roland Mas has just announced what I was more or less announcing in this teaser : FusionForge has forked been announced.