Main Page

From rallar

Jump to: navigation, search

Articles and theories about social netsites such as Wikipedia. Pages will be made available on Norwegian and English, but usually only as an abstact on the opposite language.
At present it is fairly messy here after some articles has been moved from Wikipedia. Remember that this is a wiki and under contineous reconstruction!


Your continued support is highly appreciated!


Virtual tourism and Wikipedia should fit together as hand in a glove, why then are the presentation of several obvious tourist targets so extremly bad on Wikipedia? I do not believe Wikipedia is to blame for this as such presentations can be both neutral and verifiable, the more likely cause of the problem is that the tourist industry is very traditional and because of this has a hard time adapting to new technologies. Abstract only…


Number of pageviews in Wikipedia has to be solved through embedding of statistics from a secondary service because the central service lacks the necessary data material to produce statistics, and because caching would break down if statistics are stored on the pages. Typically the statistics will change within hours or even minutes, given how parameters are set, while caching of pages could be measured in days. It is therefore strongly advisable to embed the statistics in such a way that the pages can still be cached effectively. Abstract only…


Classified vocabulary for vandalism detection describes how rollbacks of vandalism can be used to train a classifier for a vocabulary to be used on vandalism. The test results for the classifier can be used as a feature for a more advanced classifier such as BACTUS. Testing of contributions against such a vocabulary should be done as a server feature, mainly because downloading large datasets like this will take a long time compared to uploading proposed changes for testing. Read more…


Attribution when reworking of an existing article that has been previously published other places, especially those based upon distict versions from other wikis, is complex and it is difficult to do it in a fully satifying way. If there is no common technological platform then it is likely the only viable solution is to manually write down each main author and all other coauthors, note netsites and other sources. If there is a common technological platform then some parts of this can be automatic. The most common technological platform is in this case Mediawiki, and the most important problem to solve is reuse of content between projects. Abstract only…


Automatic censorship in MMC sites describes methods for automatic censoring, that be statistical methods, text analysis or more ad hoc filtering. The background for this is that MMC sites like Wikipedia is based upon editors that themselves writes and publishes, and without any clear legal subject in those occasions where there are some irregularities. A typical situation is harassment related to publishing certain information about a person, or rumors about this person. Abstract only…

Personal tools