08.21.08

The Power of Free Content

Posted in Free Culture, Wikis at 2:02 am by Erik

David Shankbone, who has contributed countless photos to Wikimedia Commons (including many very hard to obtain shots of celebrities), has written a very interesting blog post about how his photos get used throughout the universe of Wikimedia languages and projects: The global reach of just one photo. If you want to see where a photo you’ve uploaded is used, you can use the CheckUsage tool. This kind of global usage is a true testament to what’s possible when content is shared with few copyright restrictions.

07.29.08

LibregameWiki

Posted in Mindshare, Wikis at 1:58 am by Erik

Can open source games be any good? Most open source games are relatively small and simple, but the open source movement has produced some gems over the years, such as Battle for Wesnoth and Neverball. Blender’s Project Apricot is an example of a game developed using a new model of collaborative funding (DVD pre-orders) which was previously successfully used for open source 3D movie productions.

If you’re interested in keeping track of the progress of open source gaming, LibregameWiki is a great place to go. Sadly, many of the open source games it reports on aren’t considered notable enough for Wikipedia. In addition to the games themselves and the people who make them, LibregameWiki also writes about game development contests like RubyWeekend, which have recently become a source of lots of nice, open source mini-games.

06.02.08

Towards A True Wikimedia Commons

Posted in Free Culture, Tools, Wikis at 9:09 am by Erik

Wikimedia Foundation CTO Brion Vibber recently added a very neat feature to the development version of MediaWiki. In order to enable it, all you need is a snippet of code in your LocalSettings.php configuration file:

$wgForeignFileRepos[] = array(
   ‘class’            => ‘ForeignAPIRepo’,
   ‘name’             => ’shared’,
   ‘apibase’          => ‘http://commons.wikimedia.org/w/api.php’,
   ‘fetchDescription’ => true, // Optional
);

Your wiki installation will now have full access to Wikimedia Commons in the same way any Wikimedia wiki does. You can embed image thumbnails of any size, and they will be automatically generated and loaded from Commons. You can click images and see the file description (including the wiki description page, file history and EXIF metadata) loaded from Commons. I haven’t tried to make the embeddable video/audio player work yet, but any file type will be accessible.

This is wonderful, because it makes the nearly 3 million freely licensed files in Commons easily accessible to potentially thousands of wiki users, while retaining the critical licensing information. This implementation does not cache the data in the local wiki, so is not yet suitable for large scale installations. Caching the data intelligently is a significant challenge, as it could be a vector for denial of service attacks and also raises the questions how/when cached files should expire, etc. I wrote a proposal called “InstantCommons” a couple of years ago which included some notes on the issue. After an incomplete first implementation of InstantCommons, I’m glad that we now have a working, simple mechanism for third party use of Commons media. Given that the foreign repository can be any MediaWiki installation, it will also be interesting to see what other wiki-to-wiki exchanges might result from it.

03.20.08

Spam test

Posted in Uncategorized at 7:33 pm by Erik

Even though I’m using the latest WordPress version now, my blog has been hammered by some nasty spam-bots, going so far to overwrite the actual content of posts with spam. I’ve disabled all suspicious user accounts & reset passwords; this is a test to see if a random new post will still get spam injections.

(Compare the relevant WordPress forum thread.)

03.18.08

R.I.P. Arthur C. Clarke

Posted in Thoughts at 11:51 pm by Erik

Thanks for helping us imagine.

03.04.08

The Rise of the Trash Blogs

Posted in Free Culture, Wikis at 7:46 pm by Erik

Wikipedia founder Jimmy Wales is being dragged through the mud by a new type of trash blog. Tabloid journalism has always existed, but the new marketplace of ideas online allows for the rapid evolution of the most effective types of trashy pseudo-journalism: Those blogs that find the right tone — that combination of malicious insinuations, salaciousness, and half-truths — will develop a large audience; those that are just boring (like Kelly Martin rambling about Jimmy’s mortgage) will whither and die.

The recent attacks against Jimmy allege that he had some huge conflict of interest when it comes to editing the “Rachel Marsden” article. Jimmy routinely asks people on our ticket-system, OTRS, to look into articles that are one-sided when people ask him to. He had a conflict
of interest here, which, for obvious reasons, he didn’t want to disclose - so he suggested a smaller conflict of interest in his e-mail to OTRS as a reason for recusing himself from editing it. It would have been better to stay away from the matter entirely, obviously, but I don’t see it as a big deal. I’ m more worried that the community will now massively push it in the other direction as a reaction against perceived bias.

As for his personal life, it is just that: his personal life.

There are two real stories here, IMHO:

* a destructive, trashy kind of pseudo-journalism that invades people’s personal lives under the pretense of a real story;

* the destructiveness and maliciousness on the fringes of our own community.

Jimmy not only created an extraordinary project — he decided to base it on the principles of the open source / free software movement, and turned it over to a non-profit organization. This was, by no means, the obvious thing to do: Had events played out a little differently, Wikipedia would today be a dot-com with ads, probably a subsidiary of Google, Microsoft or Yahoo.

As community leader, Jimmy has developed and emphasized the values that we cherish: the assumption of good faith, the importance of neutrality in open collaboration, and the belief in a shared purpose. When he talks about bringing education to those who cannot afford it, he’s not just trying to impress. Anyone who spends 5 minutes with him will understand that this is his personal life goal.

Moreover, Jimmy stepped back graciously as Chair of the Foundation when he no longer could dedicate as much time as needed to the role. He’s helped us connect with philanthropists here in the Bay Area — donations like the recent 500,000 dollars from the last fundraiser were only possible because of his outreach efforts. His international network of contacts has helped us to build our Advisory Board, really smart people who have supported us on many occasions. In short, he’s been humble and helpful, and has always acted in the best interests of the organization.

When people try to create a malicious caricature of the man, then please remind yourself that actions speak louder than words. And also ask what the self-interest is of those who make the attacks. Whether it’s for personal reasons (15 minutes of fame, a vendetta, or simply an inherently destructive nature) or because of advertising revenue — spreading lies, insinuations and half-truths is in many people’s interest. The emergence of the new open media also means that we, as readers, have a greater responsibility to distinguish the bottom of the barrel crap — designed to stir up trouble — from honest inquiry.

Not too long ago, Jimmy and Tim O’Reilly proposed a Blogger’s Code of Conduct that we, bloggers, would voluntarily adopt. I am increasingly agreeing with him: If we want the emergence of an honest, responsible, but dynamic and healthy new media sphere, we will need a new code of ethics and a set of principles to subscribe to. We need to indicate that we are different from the trash blogs. As such, I am hereby adopting the following modules from the proposed Code of Conduct:

1. Responsibility for our own words

2. Nothing we wouldn’t say in person

3. Connect privately first

4. Take action against attacks

6. Ignore the trolls

8. Keep our sources private

9. Discretion to delete comments

10. Do no harm

I encourage you to do the same.

10.19.07

Of Gutsy Gibbons

Posted in Free Culture, Tools at 11:29 am by Erik

The upgrade of my desktop from Ubuntu Linux “Feisty Fawn” to “Gutsy Gibbon” was moderately sucky. Aside from some odd package dependency issues that I had to resolve forcefully, the upgrade messed up my X server configuration, resulting in an unusable desktop. After some fiddling and some searching, the culprit turned out to be the xserver-xgl package, which doesn’t work with my ATI graphics chipset. ATI’s poor Linux support is hardly an excuse for leaving the user’s system in a worse state than it was before. As long as basic upgrade procedures can lead to such results, we can’t seriously hope to make inroads on the desktop.

10.09.07

Open Source Mesh Networking

Posted in Tools at 7:39 pm by Erik

Looks like Meraki is getting some open source competition. An open source mesh networking platform could be exciting, especially when combined with a mini-webserver built into the platform to host content even without Internet access.

10.04.07

Stable version demo

Posted in Tools, Wikis at 8:43 pm by Erik

Yesterday I set up a first unofficial demo of stable versions in the configuration which I would like to see used on the English Wikipedia. It’s based on the FlaggedRevs extension by Aaron Schulz & Jörg Baach. Unofficial, because Brion is currently reviewing the code for security and scalability. I’m still slightly worried that we might hit a snag, as the extension goes pretty deeply into the way MediaWiki serves pages, and of course Wikipedia can’t afford to run something that causes major slowdowns.

Still, if you want to get a feeling for the kind of configuration that I, Jimmy Wales and Florence Devouard have expressed support for, play with it. The main new thing you’ll notice is a top right corner icon which indicates the status of the page you’re looking at. On selected pages, the last revision shown to unregistered users is the most recently vandalism-patrolled one, while the rest of the wiki behaves as before. Essentially, in this configuration, it accomplishes four things:

  • Allows us to open up semi-protected pages to editing, since edits have to be reviewed and vandalism does not affect what the general public sees;
  • Allows us, as a consequence, to also use this kind of “quality protection” more widely — e.g. when an article has reached a very high quality, and is unlikely to be substantially be improved by massively collaborative editing
  • Improves our vandalism patrolling abilities, since vandalism doesn’t have to be repeatedly checked: we record who has looked at which changes. Also, changes by trusted users don’t have to be reviewed. The tagging system ties into “recent changes patrolling”, an old feature that has never scaled well.
  • Allows us to make processes like “Featured article candidates” and “Good article candidates” revision-based rather than page-based. Thus we can more easily track when an article reached a certain quality stage, and can better examine whether changes past that point have increased or decreased its quality.

Some wikis would like to use more restrictive configurations than the one shown here. For example, a group in the German Wikipedia community is advocating to let all edits by unregistered users be reviewed before applying them, rather than only doing so on selected pages. I’ve argued against this at some length here. I think the current configuration strikes a good balance of openness, quality, and transparency to the reader. Of course there’s already a big wishlist, some of which should be addressed before taking this feature live.

10.03.07

Managing wiki data

Posted in Ideas, Wikis at 8:04 am by Erik

Volapük is one of those lovely constructed languages which has made it into the 100K list on our multilingual Wikipedia portal. Of course, this is due to some bot imports of census and map information. It’s all fair - the English Wikipedia also bot-generated tens of thousands of “articles” in its early history. It does raise the question whether a freetext wiki is really a good way to maintain such data.

Over at OmegaWiki, we’re experimenting with adding structured data to the multilingual concept entities we call “DefinedMeanings”. A DefinedMeaning can be accessed using any synonym or translation associated with it. An example is the DefinedMeaning Denmark. If you expand the “Annotation” section, you’ll see information such as the countries Denmark borders on, its capital, or currency.

But there’s more to it than meets the eye: the fact that a country can border on another one is expressed by giving a DefinedMeaning a class membership. The class “state” defines what relationships an entity of this type can posses. In this way we define an ontology.

Note that if you log in and change your UI language, all the names will be automatically translated, if translations are available, as each relation type is a DefinedMeaning in its own right. OmegaWiki also supports free-text annotation and hyperlinks. We’ll add more attribute types as we need them.

What does that have to do with the Volapük Wikipedia? Well, OmegaWiki is the kind of technology you could build upon to maintain a central repository of the almanac data that was imported. Then, with some additional development, you could generate infoboxes with fully translated labels directly from OmegaWiki. And, within the free-text Wikipedia, you could focus on writing actual encyclopedia articles that reference this data.

Note that OmegaWiki is not alone in the semantic wiki space. Semantic MediaWiki adds data structure by enriching the wiki syntax itself. Freebase is a closed-source free-content “database of everything.” I’m not interested in closed source technology, but I like a lot of what I see in Semantic MediaWiki, and I expect that the technologies will converge in the end, if only conceptually.

« Previous entries