Multilingual Websites
From BarCamp Switzerland
Wikipedia is multilingual.
Three levels:
- Pioneers develop something new (programming language, wikipedia) using their first language (eg. English around this time)
- Early adopters are multilingual (eg. German and English) but stick to the language of the pioneers.
- Main stream users using their first language which is not English.
There's a mysterious transition between step 2 and 3, here: How did the first non-English Wikipedia get started?
Currently users of non-English languages have a problem:
- Search Portuguese Wikipedia for Oddmuse and you'll find no matches.
- Search German Wikipedia for Oddmuse and you'll find a related page.
- Search English Wikipedia for Oddmuse and you'll find the page!
Compare with Emacs Wiki, where all pages are in the same "namespace":
- Search for Brainfuck using the German Interface on Emacs Wiki. You'll find English pages!
- Search for Rekursiv Ersetzen. You'll find German pages, too.
Compare with Stephanie Booth's blog in English and French, where all multilingual features are conventions: She manually addes French and English abstracts to her posts.
Between total separation (example Wikipedia) and no separation (example Emacs Wiki).
For group 3 above, we want to filter languages in searches, feeds (or Recent Changes).
Sister Sites introduces automation when browsing: Show pages with the same name on a few known related sites.
Near Links introduces more automation when linking: Switch linking targets between sister sites and the local site.
Near Search extends the same mechanism to searching: Since the list of page titles is available from sister sites, these are always searched, and upon request, searches are run on sister sites and the results are merged with the local search results.
There's more background on WikiNameSpace. When reading that page, just keep in mind that essential we're talking about the same topic: It doesn't matter whether a sister site differs in topic, community, or language used.
Eventually automatic translation might solve the problem. Right now translations are "poetic": Orginial blog post in German compared to the automatic translation by Google.

