Monday, October 26, 2009

Open PhD Positions in Semantic Multimedia Retrieval Project

OPEN Ph.D. POSITIONS at Hasso-Plattner-Institute (HPI), Potsdam (Germany) starting on the fourth quarter of 2009

Hasso-Plattner-Institute (HPI) is a privately financed institute affiliated with the University of Potsdam, Germany. The Institute's founder and benefactor Professor Hasso Plattner, who is also co-founder and chairman of the supervisory board of SAP AG, has created an opportunity for students to experience a unique education in IT systems engineering in a professional research environment with a strong practice orientation.
(for more information on HPI, c.f. http://www.hpi.uni-potsdam.de/ )

Project Description:
MEDIAGLOBE is part of the THESEUS research program initiated by the German Federal Ministry of Economy and Technology (BMWi), with the goal of developing a new Internet-based infrastructure in order to better use and utilize the knowledge available on the Internet. The focus of the research program is on semantic technologies, which determine contents (words, images, sounds, and videos) not through conventional methods (e.g., combinations of letters) but which are able to recognize and place the meaning of a content in its proper context. MEDIAGLOBE deals with digitalization, analysis, and semantic retrieval of historical, documentary audiovisual content. (for more information on MEDIAGLOBE, c.f. http://theseus-programm.de/theseus-mittelstand-2009/ )

The ideal candidate holds a MS degree in Computer Science or related field and is able to consider both theoretical and practical/implementation aspects in her/his work. Fluent english communication and programming skills are fundamental requirements. Since we are working on a multimedia repository with resources in German language, German language skills are welcome! Preferably the candidate has a background in one of the following
fields:
• semantic web technologies
• knowledge representations and ontology engineering
• audiovisual retrieval and analysis
• semantic search
• innovative web development
• user interface design for audiovisual content

The position starts as soon as possible and is full-time (40h/week) for the duration of the project until Oct 2011. Review of applications will begin immediately and will continue until the position is filled. The successful candidate will tightly work with international partners and has the possibility to pursue PhD work within the scope of the project.

How to apply:
Excellent candidates are invited to apply with:
• Curriculum vitae and copies of degree certificates/transcripts,
• Writing samples/copies of relevant scientific papers (e.g. thesis, etc.),
• Letters of recommendation.

Please send your application in PDF format indicating in the subject 'Application for PhD position‘ via email or via traditional mail to the following contact.

Contact and application:
Harald Sack
Hasso-Plattner-Institut für Softwaresystemtechnik GmbH
Universität Potsdam
Prof.-Dr.-Helmert-Str. 2-3
D-14482 Potsdam, Germany
phone: +49 (0)331-5509-527
fax:
+49 (0)331-5509-325
email:
harald.sack@hpi.uni-potsdam.de
web:
http://www.hpi.uni-potsdam.de/meinel/persons/sack.html

Thursday, October 15, 2009

Opening of the German/Austrian W3C-Office / Teaching the Web at FH Potsdam

Today, I'm visiting the Opening of the German/Austrian W3C-Office at FH Potsdam (only 20 minutes away from HPI with public transportation), which is entitled "Teaching the Web".

Short Opening address by Prof. Johannes Vielhaber, rector of FH Potsdam and by Felix Sasaki, followed by the first speaker.
Klaus Birkenbihl on "W3C and W3C Offices - an Overview", who gives some general information about W3C and some overview about the worldwide W3C offices and their duties. Today, there are 18 W3C offices and on of their main tasks is the recruiting of local stakeholders to become W3C (paying) members. But also have to be mentioned better synergies with local hosts for community building, acquisition of local projects, and fostering new cooperations.

Andrew Vande Moere from the University of Sidney on "Visualization for the Web", author of the information aesthetics blog. He starts with giving an overview on visualization technology ranging from simple data graphs to information art (e.g. the Web2DNA website for visualizing your website as DNA-Sequence...). The main message....ok, open up your data and make your data publicly available (that's my point! Go one step further and make it Linked Open Data!). Then, there are lots of new possibilities for intelligent data mashups (as e.g. the 'the city is the future web'), making use of the data in a completely new way. Two interesting examples for web online visualizations are Google's visualization API and IBM's Many Eyes ('the youtube of visualization'), not to forget tha ultimate data mining &visualization application on your personal data (only for 'the data addicted'), http://your.flowingdata.com/.

Malgorzata Machol from FU Berlin (instead of the announced Prof. Tolksdorf...) on "Why Semantic Web " and the "Semantic Technology Institute". Sorry, but I can't stand those web2.0/3.0/xxx timelines any more. I've seen better motivations for the Semantic Web (at least with better/newer examples). Unfortunately, in this talk the Semantic web is motivated with a rather coarse conception of 'semantics' and (human/machine) 'understanding'. There is too much of human perception (and complex human understanding) mixed up with formal semantics and technology working on formal semantics (and what can be achieved with it w.r.t where are its limits). Please try to achieve more and better differentiation. The second part of the talk is about the STI in Germany and its activities around the semantic web.

After the lunch break, the event continues with Lambert Heller from TIB/UB Hannover on 'Library 2.0 – how the web has (and is) changing education of librarians?'. Today, the library catalogue is nothing but highly structured data, but the problem of uniquely identifying strings (symbols) with subjects (catalogue entries) is only solved on the surface and in a superficial way. There are commonly used (and elaborated) data sources such as the 'Schlagwortnormdatei' or the 'Personennormdatei' that contain reliable structured information about persons (authors) or keywords. But, today these datasets are not publicly available (while it is also not certain, who really owns the copyright, and what kind of copyright at all...). Let's get all these data, triplify it and make it Linked Open Data!!!
Finally, come and visit 3rd BibCamp in Hannover in May 2010:


Patrick H. Lauke from (web evangelist at) Opera Software on 'Standards education - what students need to know about web standards and accessibility'. It', about telling the big picture to the students. Not really the complete design specification or code implementation. ('Standards are Code....and Designers don't care about code') Make clear why...not necessarely how. On the other hand, there's accessibility. Web accessibility is more than blind users and screen readers. accessibility doesn't have to be hard. In fact, a lot of accessibility is just usability! Opera accordingly is invoved in outreach and educational activities on 'Teaching the Web' (Opera: We want to make the Web a better place!).

Petra Rauschenbach from Bundesarchiv on 'Conversion, Digitisation and Internet Gateways. Strategies and their implementation at the Federal Archives of Germany'. Starting with an overview on overview of their stocks and a lot of German archival vocabulary, which I cannot translate into English. Unfortunately, Mrs. Rauschenbach is 'reading' the talk and not talking freely...Also the term 'Findbuch' (i.e. concordance, index) sounds somehow very 'retro' for computer scientiest like me. But anyhow, the content of the Bundesarchiv (Federal Archive) is available on the web....why not make it Linked Open Data??

Henry S. Thompson from University of Edinburgh on 'Teaching Web Architecture'.
students:WWW::fish:water i.e. the relationship between students and the WWW is similar to the relationship of fish and water. So, why teaching a fish about the water? Learn to think about something which usually is invisible (i.e. the technology we are using everyday). So how should we teach web standards? Basic didactic principles like 'Analysis' (decomposing concepts), Contradiction (to something taken for granted), Analogies (being offered for confounding expectations) are presented. I like the last one: E.g. 'Standards come from official standards bodies', but consider the following: IEEE (semi-private bodies) / IETF (bunch of volunteers, left-overs of a hippie community) / W3C (2 private Universities, one semi-private body and companies paying some fee)...all of them have NO legal authority. More on this can also be here: 'Identity, URIs and Semantic Web'.
Conclusion: Teaching needs to draw on theatre as well as educational theory, 'Keeping people engaged is the core of education.'

Jens Meiert from Google Inc. on 'Modern Web development – a view on the future of HTML, CSS and development practices'. As expected, we are starting in the past, 1990 HTML 1.0 by Tim Berners-Lee. What about development practices? Problems in the past ranged from technological limitations, support limitations over low output quality up to bad user experience. In the Present, more and more we are facing a separation between behavior, structure, and presentation.

Erik Wilde from School of Information, UC Berkeley on 'Information Engineering' (BTW, my very first on lecture on web technology back at FSU Jena was based on Erik Wilde's book, this was before I had written my own ;-). He's using Google sidewiki for his lecture handouts and presentations, which can be annotated by the students simply by installing the Google toolbar in your browser.
Information Engineering is bigger than the web. It includes high level skills such as information and service modelling, or knowledge about complementary architectures. The Web of Things is nothing but applied Information Engineering. Engineering can be defined as constrained-based design and implementation.

[to be continued]

Tuesday, September 15, 2009

Corporate Semantic Web Workshop in Berlin, 15.09.2009


Heute findet im Rahmen der XInnovations 2009 der Corporate Semantic Web Workshop an der HU Berlin statt, der heute abend im 3. Semantic Web Meetup seinen Abschluss finden wird.

9 Uhr morgens, noch ist alles ruhig. Kaffee allerdings scheint am morgen ein Fremdwort zu sein. Zumindest reicht es 30 Minuten vor Beginn der Veranstaltung lediglich für ein anscheinend vom Vorabend "geplündertes" Buffet mit zwei leeren Thermoskannen und ein Paar verlorener und zum Teil gebrauchter Kaffeetassen. Da lob ich mir doch Graz und die I-Semantics, bei der wir vor gut einer Woche rund um die Uhr mit leckerem illy-Kaffee (Ja! ich würde mich über eine Product-Placement-Vergütung sehr freuen...:) und Espresso versorgt wurden....

Prof. Adrian Paschke von der FU Berlin eröffnet den Corporate Semantic Web Workshop mit einer kurzen vorstellung der 'Vision' des Corporate Semantic Web Projekts, das vom BMBF seit 2008 gefördert wird. Für mich natürlich interessant der Unterpunkt und Forschungsbereich 'Corporate Semantic Search', also bin ich auf den später geplanten Vortrag zum Thema gespannt.

Gökhan Coskun aus Adrian Paschkes Forschungsgruppe schließt an mit einem Vortrag über 'Effiziente Verwaltung von Unternehmenswissen - Corporate Ontology Management'. Wozu braucht man Ontologien in einem Unternehmen? Ganz einfach, zur Steigerung der Produktivität und der Effizienz der Informationsverarbeitung (typische Wirtschaftsinformatikerantwort...). Als Hinderungsgrund für den Einsatz im Unternehmen identifiziert Coskun die 'Akademische Orientierung' vorhandener Werkzeuge. Er sieht Ontologien als normativ und allgemeingültig, was einen weiteren Hinderungsgrund bzgl. deren Einführung im Unternehmen darstelle. Dem möchte ich widersprechen, da Ontologien stets auf einer 'gemeinschaftlichen Vereinbarung' beruhen (explicit, formal specification of a shared conceptualization), die den jeweiligen Blickwinkel der Beteiligten abbildet. Allgemeingültigkeit wäre eine Eigenschaft, die gar nicht erreicht werden soll. Wir betreiben ja schließlich Informatik und nicht Metaphysik, d.h. unser Ziel ist nicht die normative und allgemeingültige Beschreibung der gesamten Welt, da eine Ontologie stets der Interpretation des Benutzers, seinem Kontext und seiner Pragmatik unterliegt.

Nach der viel zu kurz geratenen Kaffeepause geht es weiter mit Olga Streibel von der FU Berlin und dem Thema 'Semantische Suche: Tagging und Wissensgewinnung'. Als 'Extreme Tagging' werden jetzt Tags eingeführt, die selbst als Objekte für das Tagging hergenommen werden können, d.h. die eigentliche Tag-Relation lässt sich taggen. Was gewinnt man dadurch? Streibels Erklärung, dass man 'durch Tags Assoziationen bildet, mit denen man Ontologien erzeugt', hilft mir hier nicht besonders weiter. Leider wurde meine diesbezügliche Frage am Ende mit der Bemerkung, dass wir das offline diskutieren sollten, etwas unschön abgebügelt. Dabei wäre es meines Erachtens für den Vortrag von zentraler Bedeutung, eben diesen Vorteil des Ansatzes herauszustellen und gegenüber einem einfachem (individuellem) Konzept-Mapping abzugrenzen.

Ralf Heese von der FU Berlin referiert als nächstes über 'Einfach Verlinken in Wikis / Experten mittels Wikis finden'. Es geht beim ersten Thema dabei darum, den Benutzer mit Hilfe von Hintergrundwissen beim Setzen von Links durch entsprechende Vorschläge zu unterstützen. Das zweite Thema widmet sich der Frage, wie sich aus den History-Logdaten eines Wikis Experten zu bestimmten Themen bestimmen lassen.

Gleich Mittagspause....! Leider nur mit Gulaschsuppe/Möhrensuppe und Rundgang über den 'Büchermarkt' im Hof vor dem HU-Hauptgebäude.

Wieder einmal ein paar Minuten zu spät bei Richard Hubers (FIZ Chemie GmbH) Kurzvortrag über 'ChemgaPedia - virtuelle Forschungsumgebung'. Und schon wieder eine Wortneuhülse: 'Wissenscloud' - ohne sich darüber im Klaren zu sein, was exakt damit gemeint ist.

Änderung im Programm, 'Semantic Profiles in Universal Plug and Play AV', Vortrag eines Diplomanden zur semantischen Anreicherungen von UPnP AV Daten.

Weiter geht es mit Johannes Krug von x:hibit, der das Berliner Museumsportal (finanziert durch entsprechende E-Commerce Anteile, z.B. E-Ticketing, u.a.) vorstellt, gefolgt von Radoslav Oldakowski von der FU Berlin mit dem Thema 'Semantische Datenintegration und Suche im Museumsportal Berlin'. Semantisch unterstützte Suche, die Suchbegriffe um verwandte Begriffe ergänzt...mal sehen, ob sie dies auch (1) intelligent und (2) visuell ansprechend tut.

Sebastian Hellmann gibt mit 'DBPedia Live Extraction' als erstes eine kurze Einführung in das zentrale Hub der Linked Open Data Cloud, der DBPedia, gefolgt von diversen Anwendungen rund um die DBPedia. Übrigens nutzt yovisto.com (including semantic features) ebenfalls DBPedia-Daten zur Implementierung einer echten explorativen (semantischen) Suche.

[...to be continued @ Semantic Web Meetup ]

Monday, September 14, 2009

W3C-Tag an der HU Berlin, 14.09.2009

Es ist mal wieder soweit: W3C-Tag an der HU Berlin im Rahmen der XInnovations 2009, diesmal mit Prof. Felix Sasaki als Vertreter des lokalen W3C-Büros an der Uni Potsdam....und es geht auch schon gut los. Prof. Robert Tolksdorf, der die Veranstaltung eröffnen sollte, steckt im Berliner Verkehr fest und so warten wir erst einmal 20 Minuten bis hier irgendetwas heute morgen passiert.

Atemlos kommt Herr Tolksdorf mit 20 minütiger Verspätung an, entschuldigt sich kurz (ohne dabei nicht auch einen kurzen Hinweis auf die Situation der Berliner Verkehrsbetriebe zu geben und auf seine Solidarität mit den Berliner S-Bahn-Fahrern hinzuweisen) und stellt das STI (Semantic Technology Institute) vor.

Interessanter wird es jetzt schon mit dem ersten, dem 'Semantic Web' gewidmeten Vortrag, genauer geht es dem Titel entsprechend über das 'Rule Interchange Format' (RIF) und das neue OWL 2, von Prof. Adrian Paschke von der FU Berlin. OWL 2 verwirft die übliche Dreiteilung in OWL-Light, OWL DL und OWL full und definiert verschiedene OWL DL Sprachprofile bzgl. ihrer 'worst case' Berechnungskomplexität. Dabei lässt sich OWL EL sogar in Polynomialzeit berechnen (daneben existieren noch OWL QL und OWL RL). RIF als Austauschformat für Regeldialekte bringt ebenfalls wieder eine Unmenge an neuen Syntaxvarianten. In diesem Zusammenhang wird auch auf ein Handbuch hingewiesen (Handbook of Research on Emerging Rule-Based Languages and Technologies, IGI Global), das sich allerdings nicht gerade durch seinen Preis (> 300 Euro) empfiehlt...

Prof. Felix Sasaki vom deutsch-österreichischen W3C-Office stellt als nächstes die aktuellen Entwicklungen rund um den neuen HTML5 Standard vor (hier ein Link auf die im Vortrag gezeigten Beispiele). Warum eigentlich jetzt HTML 5, nachdem bereits 2000 XHTML 1.0 veröffentlicht wurde und die Entwicklung von XHTML 2.0 auf vollen Touren lief? Nun, die Entwicklung von XHTML 2.0 wurde abgebrochen, der Anspruch ein 'sortenreines' XML im Browser einzuführen (eine 'Revolution') ist gescheitert. Nach einer von Opera durchgeführten Studie 2008, ist lediglich 4.13% des gesamten Web-Codes tatsächlich valide. Daher versucht man mit HTML 5 eine (fehlertolerante) 'Evolution' des alten Standards zu realisieren.
Eigentlich ist HTML5 genau genommen sogar ein Rückschritt, da es auf eine dezentrale Erweiterung über einzubindende Namensräume zugunsten eines eindeutig zu interpretierenden DOM-Baumes verzichtet. Dahingehend sind Probleme mit semantischen Erweiterungen, wie z.B. RDFa oder Microformats vorprogrammiert!

Mittagspause im Café Chagall mit Bliny und saurer Sahne mit anschließendem Rundgang zu Dussmanns Kulturtempel...

Thomas Caspers spricht zunächst einmal über Barrierefreiheit (Die deutsche Übersetzung der WCAG 2.0 a.k.a. Web Content Accessibility Guidelines). 'POUR' steht für die vier Grundprinzipien der Richtlinien für barrierefreie Webanwendungen: Perceivable, Operable, Understandable und Robust. Allerdings konnte ich die beschriebenen Übersetzungsprobleme ('programmatically determined' usw.) nicht nachvollziehen...vielleicht hätte man mal einen Informatiker fragen sollen....

Mit gut 20 minütiger Verspätung (dank der Ausdauer des Vorredners) beginnt der Vortrag von Joachim Neuberth über SKOS (Simple Knowledge Organization Systems). Warum müssen manche Vortragende nur immer so leise sprechen. Das Verfolgen des Vortrags gestaltet sich nicht wirklich einfach (was nicht etwa in der Komplexität des Themas begründet liegt). Zugegebenermaßen ist aber auch die Entwicklung von SKOS und das dahinter liegenden Datenmodell nicht so besonders spannend. Besser wird es erst, als unterschiedliche Ressourcen, wie z.B. die Library of Congress Headings oder die französische Nationalbibliothek aufgezeigt werden und am Ende dann doch Linked Open Data angesprochen wird.

Prof. Felix Sasaki widmet sich als nächstes dem Thema Metadaten für Multimedia und berichtet von der W3C Metadata Annotations Group. Das Ziel der Metadata Annotation Group kann als "DublinCore + X" paraphrasiert werden, also ein minimales Multimediametadatenschemas mit Mapping zu bereits existierenden Formaten. Der zweite Teil des Vortrags beschäftigt sich mit XProc (XML Pipeline Language) zur Modellierung und Beschreibung von XML-Verarbeitungsketten, die eine Pipeline-artige Verarbeitung von wechselnden Validierungs- und Transformationsschritten unter Ausnutzung einer rudimentären Programmlogik (konditionale Verarbeitung, Iterationen, Selektive Verarbeitung, Ausnahmebehandlung, u.a.) mit heterogenen XML-Daten erlaubt.

[Nach der noch folgenden Diskussion, weiter mit "Brezeln und Wein" im Foyer..... :)]

Links:

Friday, September 04, 2009

i-Semantics 2009 in Graz (Day 03)


The second day of i-Semantics ended with party and dancing to live music performed by 'Egon 7', and of course with a lot of interesting talks with interesting and nice people :) This morning at breakfast, Jörg really looked as if he had not really had got enough sleep (he continued to party after the official ending somewhere downtown :)

Keynote
Peter Kropsch (Austrian Press Agency): When technologies are drivers, integrated concepts are needed for success,
talking about scenarios of future media convergence, the development of the information technologies and the possibilities opened up by them, esp. about the expectations of APA what to get out of semantic technologies. In general, keynotes without some slides (to get hold of the information structure of this 60 minutes talking) are rather difficult to follow, if the speaker is not able to awake sufficient enthusiasm in the audiences...

The Role of Semantic Technologies in Future Internet Track,
Klaus Tochtermann (Know-Center, TU Graz): The Role of Semantic Technologies in the Future Internet
explained, why the vision of the future internet is not only 'old wine in some new bottles'. Although all the components that consttute the Future Internet (Content, Services, Security, People, etc.) is already around, the main contribution of FI is the integrated view and interaction of thes components.

Marco Pistore (FBK): Highlights of the Future Internet Conference Berlin
In parallel to i-Know/i-Semantics in Berlin the 2nd Future Internet Symposium took place...

Jan Reichelt (Mendeley): Mendeley - A Last.fm for Research?
Mendeley wants to help researchers to manage their resources. Research is inherently social. Mendeley offers a desktop tool, similar to last.fm audioscrobbler that (a) analyzes your search papers and enables full-text search and (2) smart metadata extraction and generation. The mendeley web service offers an online backup of your own research library and enables on-click import from google scholar and other citation services.
Interesting stuff...I have just registered and download my desktop client for Mac OSX...

Paolo Rosso (Universidad Politecnica de Valencia): Geographical Information Retrieval and Toponym Disambiguation
Geographical filtering of information retrieval results by expanding query strings with semantically related geographical information (...like that stuff!).

Thursday, September 03, 2009

I-Semantics 2009 in Graz (Day 02)


The first day of i-Semantics ended with a guided tour through the old section of the town (that we already had enjoyed the day before), and a welcoming party located in the Kunsthaus Graz. The Kunsthaus Graz has some extraordinary architecture and reminds me more of some strange submarine lifeform than of a building. Unfortunately we had to leave earlier to prepare our talk for today....

The second day of i-semantics 2009 started in a very conveniant way - again great espresso, fresh fruits, and nice talking.

Linked Data Track
Harald Sack (HPI): How can Software Developers benefit from Linked Data Vocabulary,
Well, the title is misleading somehow. Of course software developers do benefit from the use of linked data, but.....the problem is the feasibility of APIs and automated tools for mapping knowledge representations to object-oriented data structures. I had to deliver the talk (instead of Matthias, who actually was the main contributor, but is STIL on vacations) and I'm afraid that I have overstressed my audience with lots of software engineering details (bad for an introductory first talk of a session on linked data...).

Jörg Waitelonis (HPI): How to augment Video Search with Linked Data,
Jörg introduced yovisto.com and its connection to the Linked Open Data cloud, featuring our 'exploratory search' widget, suggesting additional search results to the user and providing seredipenditios findings.

Atif Latif (Know-Center, TU Graz): The Linked Data Value Chain: A Lightweight Model for Business Engineers
First, Atif stated significant differences between the aims of the scientific community and enterprise level business. He introduced the concept of Linked Data Value Chain, where the most valuable output consists out of an increase of the (code/data) readability for the human engineer, thus also serving the same duty as Matthias work in making semantic web software engineering very, very simple. So.....let's make Semantic web a No-Brainer (finally) ;-)

I had to skip parts of the afternoon sessions (Poster/Short-Paper and Student Sessions) due to interesting talks and reviewing work (deadline today...unfortunately). But I'm looking forward to more interesting contacts, lots of delicious espresso, maybe again some cake and pastries, and then of course the Gala Diner in the evening...

The Gala Diner startet with an interesting performance of Austrian cultural heritage. In particular, we listened to a musical performance of 4 original "Wurzelhorns" (Wurzelhorn is a variation of the well known Alphorn).

Wednesday, September 02, 2009

i-Semantics 2009 in Graz (Day 01)


Although already having been at the PC of i-Semantics some years ago, this year it's the very first time for me being in Graz. We've already arrived yesterday and spent an beautiful day with sight seeing in the baroque historic section of the town (including the Schlossberg with 260 steps in the midday sun).

The conference is starting right now with greetings by Hermann Maurer (TU Graz) and Klaus Tochtermann (Know-Center, TU Graz). I will report about all upcoming the conference highlights...

Keynote 01,
Paolo Traverso: Towards a future Internet of Services and Content
Pointing out differences between real worl services and software services (duration, accessibility, time contraints, etc.) towards the proposition of a 'Future Internet', where research is focussed on modeling, compositing, and monitoring real world services...

Social Semantic Web Track,
Liana Razmerita: Towards a New Generation of Social Networks: Merging Social Web with Semantic Web
Conclusion: Semantic Web Technologies will enable the development of a new generation of social networking applications...nothing new up to now..

Web 2.0 und Neue Medien Track,
Watraud Wiedermann (APA Defactor): Semantische Suche in Medienarchiven
Demonstration of search application on newspapers, deploying synonym search and search for significant co-occurrences.

Knowledge Visualization Track,
Sven Havemann (TU Graz): Patterns of shape Design
Raising the question 'What's the point on semantic enrichment of images?' One of the problems in computer graphics is that different communities use slightly different vocabularies with inherent ambiguities. Furthermore, there is also a 'semantic gap' in the way that (correct) CG-algorithms often do not deliver the expected result, due to the lack of a proper vocabulary. This leads Haveman to the development of a conceptual reference model (like CIDOC-CRM) for shapes.


Innovative Funktionen für E-Learning Track,
Viktoria Pammer (TU Graz): Intelligente Ad-hoc Erstellung von Lerninhalten mit semantischen Technologien
Presentation of the APOSDLE system that proposes learning material to the learner according to his current skills and needs. Seems to be an Intranet (non web based) system...

...and now on for some espresso and sone 'real Austrian' cake and pastries ;-)

Corporate Semantic Web Track,
Fan Bai (Uni Duisburg/Essen): Exchanging Knowledge in Concise Bounded Descriptions. An Approach to Support Collaborative Ontology Development in an Distributed Environment
Interesting variant of versioning of ontologies (for distributed ontology development). Using simple cvs is not suitable, simply because most ontologies simply consist out of one huge file...

Usage and Case-Studies Track,
Sanja Vranes: Maturity and Applicability Assessment of Semantic Web Technologies


ok...ready for 'Guided Tour from Conference Venue to Welcome Receiption' at Kunsthaus Graz

[see you again tomorrow...]

Sunday, June 21, 2009

Digitale Kommunikation

Am 21. Mai 2009 ist unser Neues Buch 'Ch. Meinel, H. Sack: Digitale Kommunikation' bei Springer erschienen, das ich hier heute vorstellen möchte. Hervorgegangen ist das Buch aus dem Absicht, unserem 2003 erschienenen Buch 'WWW - Kommunikation, Internetworking, Web-Technologien' eine zweite Auflage folgen zu lassen. Dies allerdings erwies sich als schwierig. 1200 Seiten in einer Disziplin, die sich so rasant weiterentwickelt, dass sich in den mehr als 5 Jahren, die seither vergangen sind eine Stofffülle angesammelt hat, die in einem Band einfach nicht mehr ausreichend behandelt werden kann.

Daher unternahmen wir Absprache mit dem Verlag das Wagnis, den Band in seine drei Grundbestandteile zu zerlegen und diese separat als einzelne Bände einer Trilogie zu veröffentlichen. Deren erster Band, die 'Digitale Kommunikation' liegt nunmehr vor. Die beiden Folgebände 'Internetworking' und 'Web-Technologien' sind in Vorbereitung und werden bald erscheinen.

Worum geht es im ersten Band der WWW-Trilogie? Wie schon im ersten Teil des 2003 erschienenen WWW-Buches dreht es sich in diesem Band um die Grundlagen der Rechnerkommunikation, die durch eine ausführliche historische Betrachtung eingeleitet werden und insbesondere die Gebiete der Kodierungstheorie und der Multimedia-Kodierung und -Komprimierung, sowie die Grundlagen der Kryptografie abdecken.

Hier das Inhaltsverzeichnis:

DIGITALE KOMMUNIKATION

(1) Prolog
(2) Geschichtlicher Rückblick
(3) Grundlagen der Kommunikation in Rechnernetzen
(4) Multimediale Daten und ihre Kodierung
(5) Digitale Sicherheit
(6) Epilog

In den Anhängen befindet sich ein ausführliches Personenregister, das vom ägyptischen Pharao Ramses II. und seiner 'ersten' Bibliothek bis hin zum 1970 geborenen Vincent Rijmen, dem Miterfinder des AES-Verschlüsselungsverfahren reicht. Das Buch stellt auf gut 430 Seiten mit zahlreichen Abbildungen die fundamentalen Grundlagen der digitalen Kommunikation dar. 17 einzelne Exkurse vertiefen dabei wichtige Themengebiete, die vielleicht nicht für jeden Leser gleichermaßen von Interesse sind. Jedes Kapitel ist mit einem ausführlichen Glossar abgeschlossen und über 250 Literaturverweise und Referenzen regen zum Weiterlesen an.

Weitere Informationen:

Sunday, May 31, 2009

ESWC 2009 at Heraklion, Greece, Day #01

Perfect loction for a summer holiday....as well as for the European Semantic Web Conference 2009. Arriving yesterday afternoon at a beautiful sea resort, blue skies, blue sea, white beaches...Kind of an environment that makes it hard for speakers and presenters at a scientific conference keeping their audience interested ;-)

First Day starts with several Tutorials and a Workshop. I have chosen the "Evaluation of Semantic Technologies" workshop for the morning. I cam a little bit late, thus the first presentation I fully attended was Ulrich Küster's ''evaluation of Semantic Web Service technology", which is not exactly my research topic. Interesting was the discussion whether a binary measurement for ranking results of service matching is appropriate or not (it's not...).

In the following presentation, Jerome Euzenat is talking about 'Ontology matching Evaluation". This topic for me is much more interesting (although not being my research topic).

Unfortunately the internet connection was so bad at ESWC 2009 that 'Live Blogging' was almost not possible. Sorry for that! But, at last I switched to twitter to give some 'live' updates. Just search for the hashtag #eswc2009 at twitter to get a timeline of tweets related to the conference program of ESWC 2009.

Chris Bizer from FU Berlin is the last presenter in the tutorial with a talk about 'Evaluation of Semantic Storage Systems'.

Monday, March 23, 2009

Artifacts of modern information society

Artifact, i.e. an error or misrepresentation introduced by a technique and/or technology. Most obvious are artifacts in lossy image compression techniques such as, e.g. the jpeg compression algorithm. For jpeg, the entire image is divided into 8x8 pixel squares (of course for all three color channels... but not RGB. For jpeg the RGB picture is first transposed into the YCrCb color space, i.e. Y for luminance and Cr, Cb for chrominance being subdivided into red and blue. BTW, there is also subsampling, i.e. luminance is sampled with higher accuracy than chrominance w.r.t the human sensory perceiption). Then these 8x8 squares of intensity values are transformed into the frequency domain via a discrete cosine transformation. Up to now, no artefacts, no data loss.....

But, next comes a quantification algorithm that rounds the frequency values within the 8x8 squares. If this is done with high accuracy, often one doesn't even notice it within the picture. But, increasing quantification also results in higher data compression, which also results in more data loss, which creates ... artifacts. You can see the typical jpeg raster effect, whenever using high jpeg compression.Here's a cool animation of an image being jpeg compressed 600 times (in a 20 second short movie). The simple algorithm goes:
Open the last saved jpeg image
Save it as a new jpeg image with slightly more compression
Repeat 600 times.



Generation Loss from hadto on Vimeo.

Monday, March 09, 2009

CeBIT 2009 - Aftermath

So, how was this year's CeBIT computer fair? The media have been rather pessimistic beforehand. 25% less exhibitors, 20% less visitors, but despite the worlwide economic crisis you'll find confidence everywhere ... at least now.

This year, I visited CeBIT only for a single day as exhibitor as well as a visitor. On friday, March 6th 2009, there was 'startup day' at the HPI booth in hall 9, B.10., and we presented our video search engine yovisto (well, it was almost only Jörg who did the presentation, while I was playing 'visitor'...). As every other year, for me hall 9, the 'future parc', is the most interesting hall of all 26 (!) halls. BTW, did you now that the Hannover fair area is the largest in the world? In hall 9 you could find the latest research of universities and research institutions (as well as some government and public administration...).

One thing that was pretty obvious was that hall 9 was not as crowded with exhibitors as the years before (at least in the rear areas this could be realized pretty heavily). I also was a little bit disappointed about our former booth at the joint exhibition stand of the universities Thüringen, Sachsen, and Sachsen-Anhalt (=Mitteldeutschland), because there was nothing really interesting to see, and most exhibitors were busily sitting in front of their computer screens.....presenting their 'backside' to the visitors and avoiding contact...not really inviting at all. No wonder, if they don't make it in the run for excellence...


But, on the other hand, I really liked the exhibition stands of the Fraunhofergesellschaft and the THESEUS research programme. Esp., at THESEUS (user scenario CONTENTUS) they demonstrated the workflow for digitalization, restauration, and indexing audiovisual media (...just my subject). While digitalization and restauration work pretty well, the indexing capabilities are still rather limited. You might search for 'similar' motives, but identifying individual persons is still subject to ongoing research.



After all, I was really happy to stay just for one day. Some of our co-workers had to stay for the entire week (my sincere condolences!). Was it worth while? For one single day, of course yes it was!

P.S. Alas, there really was a Latvian company that showed 'brains' in various colors.... ;-)

Wednesday, February 18, 2009

Radio Fritz in der Semantic Web Vorlesung

Vor einigen Wochen saß ein tatsächlich Radioreporter in meiner "Semantic Web" Vorlesung hier am HPI in Potsdam. Allerdings - so steht zu befürchtet - hatte er nicht unbedingt viel Spaß am Thema der aktuellen Stunde, denn es stand die "Semantik von OWL" auf dem Programm, die ohne jegliches Vorwissen genossen, doch zu erheblichen Nebenwirkungen (z.B. Zweifel am eigenen Selbst- und Weltverständnis, Aufmerksamkeitsdefizit oder Schlafstörungen) führen kann ;-)

Der Grund für den Besuch lag (leider) nicht am Thema der Vorlesung als vielmehr an der Aufzeichnung derselben und dem aktuellen Angebot an HPI-Lehrveranstaltungen auf itunesU (von dem hier schon berichtet wurde). Radio Fritz hat ein kurzes "special" produziert, in dem über dieses Angebot berichtet wird und auch einige (unserer) Studenten -- und am Ende auch ich -- zu Wort kommen. Nette Sache ... aber ich zweifle daran, dass der Reporter tatsächlich, wie von ihm behauptet, erst "nach 10 Minuten" nichts mehr verstanden hat ;-)

-> Link zum podcast

 

Wednesday, January 14, 2009

HPI Vorlesungen auf iTunesU

Seit gestern (Montag, 13.1.2009) bieten neben dem Hasso-Plattner-Institut der Universität Potsdam auch die Albert-Ludwigs-Universität Freiburg, die RWTH Aachen und die LMU München Vorlesungsaufzeichnungen und Lehrangebote via iTunesU als Video/Audio-Podcasts an. Das Ganze startete mit gehörigem Presse-Rummel (siehe Links) und ich bin schon einmal auf die Nutzungszahlen gespannt, über die ich hier natürlich wieder berichten werde....

Natürlich ist das Thema Vorlesungsaufzeichnungen und deren Distribution via Podcasts nichts Neues. Bislang war der "exklusive" Distributionsweg über Apple iTunes' separate "Bildungs-Channels" nur englischsprachigen -- also vorwiegend US-amerikanischen -- Universitäten vorbehalten. Nichtsdestotrotz findet man aber auch im regulären iTunes Angebot den ein oder anderen Bildungshappen, vielleicht auch von einer nicht ganz so exklusiven Universität.

Und wenn man nicht unbedingt zum proprietären "Marktführer" Apple gehen möchte gibt es da natürlich auch noch das universelle akademische Videoportal yovisto mit seinem unschlagbaren Vorlesungsangebot von mehr als 6000 Vorlesungen und wissenschaftlichen Vorträgen, das zudem auch noch inhaltlich durchsucht werden kann...

Links:

Thursday, November 13, 2008

3. tele-TASK Symposium am HPI in Potsdam

Heute und morgen (13./14. November 2008) findet am Hasso-Plattner-Institut in Potsdam das 3. tele-Task Symposium statt. Ich freue mich auf ein spannendes Programm als auch auf interessante Gäste (unter anderem von der ETH Zürich mit dem Projekt REPLAY, Andreas Nürnberger von der Uni Magdeburg, das Fraunhoher IDM aus Illmenau und viele mehr...).

Natürlich werde ich selbst auch im Programm vertreten sein zum Thema "Semantisch unterstützteu Suche und Navigation in audiovisuellen Datenbeständen" (Slides gibt es später hier via slideshare).

Tuesday, November 04, 2008

Trendseminar "Semantische Technologien" in Stuttgart am 5. Nov. 2008

Morgen werde ich in Stuttgart die Moderation durch das Seminar "Semantische Technologien - Wissen intelligent und gezielt nutzen", initiiert durch die MFG Fazit Forschung - Informations- und Medientechnologien in Baden-Würtemberg. Dabei geht es insbesondere um die Anwendung von Semantic-Web Technologien in Unternehmen.

Auf der Gästeliste stehen unter anderem Prof. Rudi Studer vom AIFB Karlsruhe und Tassilo Pellegrini von der Semantic Web Company aus Wien, und ich freue mich schon auf die abschließende Podiumsdiskussion, in der ich von den Gästen ihre Meinung zum aktuellen Marktpotenzial der semantischen Technologien, den Chancen für mittelständische Unternehmen und die Zukunft des Semantic Web erfragen werde....

Wednesday, September 24, 2008

W3C-Day, Berlin, 2008 (at XInnovations 2008)

Today, I'm going to visit the annual W3C-Day being connected with the XInnovations 2008 in Berlin. According to the program the sessions were supposed to start at 9 am (For this reason I got up rather early thismorning....) but also the first speaker seemd to think that 9 am is a little bit early ;-) So we started with a 20 minutes delay. In the introducing talk, Klaus Birkenbihl from W3C Semantic Web Activity Group is giving an introductory talk on the Semantic Web, solving the question, how to explain Semantic Web to an (more or less' ordinary web user. Not a simple task, but the best you can do ist to explain it via examples, showing that integrating information in the web today is a rather tedious and extensive manual work. Of course, with semantic web technologies, automated integration of heterogeneous data might soon be possible...

Unfortunately, the speaker for the second talk did not show up. Moreover, it should have been a presentation about the semantic search engine ConWeaver, which I know very well and I was rather curious about its progress. Therefore, the session continues with some kind of RDF tutorial being presented by Lars Bröker from Fraunhofer IAIS. Next, a short introduction in SPARQL is given by Thomas Tikwinski from Fraunhofer IAIS und W3C DE/AT.

[to be continued]

Tuesday, September 23, 2008

XInnovations 2008, Berlin, Day 02 - Sept. 23, 2008

Today, I'm going to visit the 'Corporate Semantic Web' workshop at XInnovations 2008 in Berlin. At least it seams that semantic technology has reached industry and corporations. "There is no market for semantic technology", as Christoph Tempich from Detecom Int. quotes a former oracle statement in his talk "Analytics drive the Corporate Semantic Web". Therefore, you just have to provide another label, which is 'Enterprise Information Management' with semantic web technology as underlying technology.

During the coffee break I followed a discussion on the planning of an 'Asocial Semantic Web Workshop' for the next WWW conference or ESWC conference. The goal o fthe workshop should be to show in which way the semantic web is vulnerble by SPAM or other offensive techniques, as e.g. denial of service by providing a deadly RDF-sequence that causes temporary data to grow exponentially.....sounds rather intriguing. I'm looking forward wo contribute ;-)

The 2nd session this morning starts with a presentation from Markus Luczak-Rösch from FU Berlin on 'Corporate Ontology Engineering'. Next, Holger Seubert from IBM is presenting 'Enriched Content Browsing', i.e. during page load in the traditional web, the web page is enriched with additional content. The text of the web page is analysed and terms of interest (info spots) are selected and linked with additional contextual information (from the web, from corporate data bases, etc. in another frame of the same window) without leaving the current context.

We had lunch in a small cafe underneath the nearby public railway with russian dishes (Cafe Chagall, Georgenstr. 4, 10117 Berlin). The pelmeni was really delicious!

The afternoon session starts with Thomas Hoppe from Ontonym with a presentation on 'Corporate Semantic Web'. According to his interpretation, the general 'Semantic Web' concept of Tim Berners Lee cannot be simply transported into the corporation as it is. Inside the corporation, it's a different world compared to the outside. All users are employees, vocabulary is (most times) strictly controlled, there are strict access restrictions, services have to be integrated in portals and corporations have to support corporate processes. The session continues with a presentation by Ralf Heese from FU Berlin on 'Corporate Semantic Collaboration'. He introduces the simple text-annotation tool loomp which has the purpose to enable nonexpert users to provide semantic annotations.

[to be continued tomorrow, W3C-Day, XInnovations 2008, Berlin, Day 03...]

Monday, September 22, 2008

XInnovations 2008, Berlin, Day 01, Sept. 22, 2008

For the third time, I'm attending the XInnovations (formerly known as XML-Days) 2008 in Berlin. Although I was often rather dissapointed about the quality of the conference program (you might refer to my previous posts about XML-Tage Berlin here), I decided to give it another try (simply because I can reach Humboldt University in Berlin with local public transportation in about 45 minutes). Also this time, there seemes to be some emphasis on semantic web technology (at least considering the program, there are Semantic Wikis in the Corporate Wiki track and anoteher Corporate Semantic Web Workshop, not to forget the Semantic Web topic in the PhD-Forum).

I started the first day of the coference with participating the Coprporate Wiki Infotag. Denny Vrandecic is talking about the "Semantic Media Wiki" from AIFB Karlsruhe. Denny is starting his talk with some historical facts about WIkipedia. Interesting thing to mention, according to a study from Aaron Swartz, only 2% of all Wikipedia users (it come up to 1.400 people) are primarily responsible for all article changes. This contradicts the commonly assumed opinion that wikipedia is written by millions of users. Then he was introducing the semantic web in general, by stating that the semantic web is nothing but things (nodes, concepts) being connected by certain relationships, forming graphlike structures that can again be related to each other. According to his definition, a semantic wiki is nothing but graphs being created from wiki data.

The next talk I'm attending is in the Ph.D workshop. Olaf Hartig is talking about 'Trustworthiness of Data on the Web'. With the Semantic Web more and more software agents are taking decisions based on (RDF-based) data on the web. But how can we trust those data? Olaf is developing an RDF trust model as a basis for trust assesment and trust-aware data acces. He suggests a scale from [1-;1], where -1 represents 'absolute distrust' and +1 'absolute trust' for a statement. Now, all relationships in an RDF-graph can be weighted with according trust values ranging from [-1] to [+1]. Trust into a set of statements can be expressed with aggregated trust functions ranging from a cautious (conservative) minimum to a slightly optimistic median. The formal trust vocabulary can be found here. Next, criteria for trust assessment are collected and three different trust assessment strategies are defined: user-based (ask the user on his/her opinion about the trustworthiness), provenance-based (taking into account the trustworthiness of the referring users), and opinion-based (recommendations by other users according to their own trustworthiness).

The afternoon session started with Nils Barnickel from Fraunhofer IOCS with a talk on 'Semantic Mediation between Loosely-Coupled Information Models in Service Oriented Architectures'. Semantic descriptions of Web Services are supposed to enable data and service interoperability. One problem being addresses ist the lack of efficient ontology mapping options in current existing onlology languages (although OWL does have a differentFrom or sameAs operator, complex mappings deploying concepts with totally different subgraphs or 1:n, n:m mappings are missing).

Ok, now it's definitely time for a coffee break. after that, I will be joining the 'World Cafe' session, where I will participate in the discussions instead of writing blog.

[to be continued tomorrow, XInnovations Day 02, Corporate Semantic Web Workshop]

Tuesday, July 29, 2008

Happy 50th Birthday NASA

Well, NASA, the National Aeronautics and Space Administration, celebrates its 50th birtday. On July 29th, 1958 US-President Eisenhower signed the „National Aeronautics and Space Act“ and NASA started to work on October 1st, 1958. Just a few months earlier in autumn 1957, the Soviet Union launched the very first artificial satellite SPUTNIK 1, resulting in the so called "Sputnik shock", paraphrasing that the western world was shocked that the USSR was really able to do this...and by doing so the entire western world (esp. USA) was commited to an atomic thread. We all know the story of the Cold War.... (BTW, for this reason Eisenhower also founded ARPA, the "mother" of the Internet).
The race began and now, 50 years later, Russian and US-american astronauts are working together in the International Space Station ISS.

Although I'm quite younger than NASA, NASA made a big impression on my childhood days. Remember the Apollo program and the first man on the moon. As almost every child I wanted to become an astronaut - or at least a scientist (got it!). Belief it or not, my very first memories of television are pre-launch cuntdowns of the Apollo Program (I really don't know which mission, but obviously one of the later). I remember the countdown was stopped several times and I was very angry, because I had to go to bed and could not watch the lift-off. I dimly remember even the Skylab program (as well as its early "re-entry" in 1979) and of course the first launch of the Space Shuttle in 1981 (and again with delayed countdown..., at Google Video you may watch the lift-off video of STS-1 Columbia).

But, the most interesting thing for me always have been NASA's planetary missions, giving us wonderful pictures of Jupiter or Saturn (Pioneer and Voyager) and the other planets.

NASA has opend up its Picture Archive with tons of pictures for free use. Nicely organized you may find pictures from Hubble, planetary missions, the space program, and many more....(But beware, today their servers have to keep up with an intense workload because of their birthday event).

Monday, July 28, 2008

Google findet mehr als 1 Billion(!) Web-Adressen...

Ich hatte es ja schon immer gewusst, dass der Gogle Suchindex "ziemlich" gross ist. Die letzten "offiziellen" Zahlenangaben, die mehr oder minder indirekt gemacht wurden, besagten, dass Google im Jahr 2005 einen Datenbestand von 24 Milliarden Webseiten im Index verwaltet [1]. Aber die Zeit bleibt ja nicht stehen und das Web wächst beständig....und jetzt schreibt der "official Google Blog" am Wochenende, das der Google-Suchindex die magische Marke von 1 Billion (!) Webseiten überschritten hätte.....[2]

Natürlich muss man bei US-amerikanischen Zahlenangaben stets vorsichtig sein. "One billion" steht ja lediglich für unsere "Milliarde". "Eine Billion" dagegen sind tatsächlich 10^12 (1.000.000.000.000), im Englischen "one trillion", also eine ganze Menge. Jetzt stellen wir uns einmal vor, wir haben diese Billion Indexeinträge, die zudem noch untereinander verlinkt sind. Würde man diese Datenstruktur klassischerweise als Matrix speichern, bräuchte man 10^24 Einträge, von denen die allermeisten ja leer wären. Also speichert man eine derartige Datenstruktur doch besser auf effizientere Weise. Allerdings muss man dabei bedenken, dass der Zugriff auf Links immer noch sehr schnell erfolgen muss, da die iterative PageRank-Berechnung ja auch nicht ohne ist [3]. Ich wäre wirklich einmal daran interessiert, wie lange jetzt eigentlich eine komplette Berechnung des PageRanks für den Gesamt-Datenbestand heute dauert....

[NACHTRAG:]
Tja...man soll ja den Tag nicht vor dem Abend loben...
Der San Francisco Chronicle setzte heute einen Nachtrag zu o.a. Google Meldung, in der es hieß, dass der GoogleBot zwar mehr als 1 Billion Webseiten gefunden hätte, von diesen aber lediglich 30 - 50 Milliarden im Google-Suchindex verwaltet werden [4]. Naja, immerhin haben wir jetzt einen Anhaltspunkt, wie groß das WWW sein könnte.....und dass tatsächlich auch nicht alles bei Google gefunden werden kann.


References:
[1] TNL Blog: Google has 24 billion items index, considers MSN search nearest competitor, September 2005.
[2] The Official Google Blog: We knew the Web was Big....., Juli 25, 2008.
[3] Sergey Brin and Lawrence Page: The anatomy of a large-scale hypertextual Web search engine, Computer Networks and ISDN Systems30(1-7):107--117(1998).
[4] SFGate: New Search Enging challenges Google, July 28, 2008.

Friday, July 11, 2008

Ceci n'est pas Stockholm.....


...or even more "ceci n'est pas Google?"

Very nice allusion to Google's image search found at blogoscoped.com. I hope we will soon find more of that kind :)

Tuesday, July 08, 2008

Das magische Medium....

...unter diesem Titel stand ein Artikel von Christian Schmidt-Häuer im ZEIT-Bildungskanon am 19. Juni 2008 (...jaja, ich weiss, ich bin z.Z. wieder einmal etwas hinterher, was das Lesen meiner ZEIT betrifft). Aber schön, dass sich hier einmal ein Artikel findet, der die Ursprünge unserer Basiskulturtechnik - die Schrift - zum Thema hat, zu dem ich ja auch schon publiziert habe (siehe unten). Ebenso war das Thema Schriftentwicklung auch schon in einigen meiner Vorlesungen mit von der Partie.

Daher habe ich den ZEIT-Artikel mit großem Vergnügen gelesen, insbesondere da als Aufhänger der Geschichte sogenannte Quipus auftauchen, präkolumbianische Knotenschnüre, also weniger ein echtes Schriftsystem als vielmehr eine Art komplexer Merkzettel oder - um mit heutigen Hilfsmitteln zu sprechen - eine Art frühzeitliche Excell-Tabelle.

Interessante Bemerkung am Rande: Quipus sind auch eines der wenigen Schriftsysteme, die nicht Einzug in den universellen Unicode-Standard genommen haben - was wohl auch an der schwierigen Darstellungsform inkl. Farbe und Dreidimensionalität liegt (zumindest wurde mir dies in einem lange schon zurückliegenden Vortrag eines Vertreters des Unicode-Konsortiums so erläutert...).

Links:

Friday, June 27, 2008

Adaptive Multimedia Retrieval 2008 in Berlin, June 26-27, 2008 - Day 02

After the dinner cruise along the river Spree, the second day of Adaptive Multimedia Retrieval 2008 again starts with an interesting invited talk on the European answer to Google search engine technology - THESEUS.

Karsten Müller from Fraunhofer Heinrich-Hertz-Institute is presenting on "THESEUS Project - Applications and Core Technologies for the Semantic Web". First, Karsten makes clear, that THESEUS doesn't want to be Google ;-) THESEUS is a research program for a new internetbased knowledge infrastructure....which from my point of view means nothing else but "the semantic web"....
One part of the THESEUS project is ALEXANDRIA, the virtual library, being lead by Yahoo! with the objective of semantic processing of different forms of content to enable faster access to relevant content, which again means an increase in information quality. Concepts such as an automated tagging framework (including language error correction, synonym & tag merging, and topic focussing, identification of semantic relations), innovative navigation (by presenting thematically related contents) and interaction concepts are involved.
Another part is ORDO, which deals with "Organizing your digital life" with the goal to unify various data formats, multilingual information, structured and unstructured data on the web to enable homogeneous information sources.Problems such as separating important from unimportant, ordering information instead of searching, priorization, identification and visualization of interrelations are addressed.
TEXO is another part with the objective of "Realizing the internet of services" (being lead by SAP Research), offering personalized customized services, community involvement to improve services, as well as a smooth & seamless (userfriendly) adaption and integration of services.
PROCESSUS deals with the "Optimization of business processes" aiming for the objective of anytime providing the user with theright information at any stage of the business process.
MEDICO is another subproject dealing with "Towards Scalable Semantic Image Search in Medicine" and being lead by Siemens.
CONTENTUS, as being the last Use case "Content access and generation from cultural institutions is lead by the Deutsche Nationalbibliothek. Being part of CONTENTUS are tasks such as Digitizing books as well as audiovisual material (including the German Music Archive in Berlin) protecting the cultural heritage. The goal is the semantically interlinked collection of content to achieve a next generation multimedia library.
.....impressive and ambitious project!

The upcoming section this morning is on "Image Tagging" and Marius Renn (at least I hope so) from TU Kaiserslautern is givig a presentation on "Automatic Image Tagging using Community-Driven Online Image Database". Automatic image tagging requires a lot of training data....and flickr is delivering tons of tags per day...but are these flickr data really good candidates for learning? So, in the end, unfiltered community image sets directly do not provide satisfying results. Alas, these databases at least allow large scale image aggregation...
The next talk in this session is given by Christian Hentschel from Fraunhofer HHI Berlin about "Automatic Image Annotation Refinement using Object Co-Occurences". Again, flickr is the target image set with its huge collection of more than 2 billion images, growing by 3 million photos every day. Objects always appear and are perceived in a semantic context.

The following session is on "Symbolic Music Retrieval" and starts with Rainer Typke from Austrian Research Institute for Artificial Intelligence (ÖFAI), but I had to skip this talk. Anyway, the samples of the reduced MIDI files were quite interesting (although I'm not a fan of the Scorpions!). OK, I had to ask afterwards about the usefulness and application of his approach. In music retrieval it can be used to reduce the index size down to 30% of the original index. Also QBE-processing will become much easier while on the other hand you might connect this MIDI-collection to real music files.
The last talk of the morning session is given by Giancarlo Vercellesi from University of Milan on "Automatic synchronization between audio and partial music score presentation". He presents the ParSi architecture, which perfoms an alignment of PCM signal and partial MIDI scores.

The afternoon session is simply entitled with "Systems". Fernando Lopéz from Madrid is giving a presentation on "Towards a fully MPEG-21 compliant adaption engine: complementary description tools and architectural models". Within the MPEG-21 framework several aspects of metadata-driven adaption is not clearly covered. He introduces CAIN, a tool for adapting Digital Items e.g. to different output devices.
The session continues with a presentation on "Mobile museum guide based on fast SIFT recognition" with the objective to identify paintings in galleries simply with the help of mobile pattern recognition without any extra installation on site. The SIFT (Scale Invariant Feature Transform) method is a rather cool algorithm for detecting local features within images that are used to map photographs taken with your PDA or mobile phone in the image gallery with reference pictures from a given database. And actually the live demo did work :)
I guess, we will also use the SIFT-algorithm in yovisto for synchronization of ppt/pdf-slides with the lecture video.

For the last session - "Structuring of Image Collections" - only one speaker showed up. Marc Gelgon is presenting on "Geo-temporal structuring of a personal image database with two-level variational Bayes mixture estimation".

[to be continued...]

Thursday, June 26, 2008

Adaptive Multimedia Retrieval 2008 in Berlin, June 26-27, 2008

The next two days, we are attending the Berlin Adaptve Multimedia Retrieval 2008 Workshop at the Heinrich Hertz Institute being located in downtown Berlin. So, it's pretty close to home and the only travelling involved was by S-Bahn :)

The first speaker is Francois Pachett from Sony CSL giving a keynote entitled "What are our audio features worth?"
The fundamental questions are "What makes objects what they are?", ""What are the features of subjectivity?", "How do we perceive objects and how can we transfer this to a machine?" Pachet's research is concerned with the classification of musical objects based on the so called polyphonic timbre that describes the sum of all features of a music object. Interesting thing is the identification of hubs, i.e. songs that are pretty close to every other song. Hubs in general seem to be mere artefacts of static models.
Interesting fact ist that there are companies now, predicting if your song is going to be a hit. Their judgement also relies on feature analysis and they even give recommendations how your song can be improvent to become a hit. Of course you have to pay for that service...but does it really work??

After the coffee break, there's a session on User-Adaptive Music Retrieval. The first talak is presented by Kay Wolter from Fraunhofer IDMT Ilmenau on "Adaptive User-Modelling for Content-Based Music Retrieval". They are adapting a content-based music retrieval system (CBMR) according to user preferences that are determined by acceptances and rejections of recommended songs by the user, which is furthermore used to improve the quality of music recommendations....Reminds me somehow to Pandora or last.fm...
The second talk is presented by Sebastian Stober from Otto-von-Guericke-Universität Magdeburg on "Towards User-Adaptive Structuring and Organization of Music Collections". So, wouldn't it be nice to structure your music collection automatically...but not in the way the software tells you, but the way you like it? The presented system is based on an general adaption approach using self-organizing maps that can be adapted by user interaction.

The first afternoon session is on "User-adaptive Web Retrieval" and starts with a presentation of Florian König from Johannes-Kepler-Universität Linz on "Using thematic ontologies for user- and group-based adaptive personalization in web searching". He introduces Prospector, which is a generic meta-search layer for Google, not constrained only to web search, based on re-ranking of search results and deploying user modells based on Open Directory Project (ODP) taxonomies. As far as I have understood, the applcation is based on the carrot2 framework for open source search engine result clustering.
Next, David Zellhöfer from BTU Cottbus presents on "A Poset Based Approach for Condition Weighting". Similarity search can be determined according to different conditions w.r.t. the search query. Esp. different people have different expectations if it comes to similarity. So, condition weights have to be determined by psychological experiments.

The second afternoon session is about "Music Tracking and Tumbnailing" and starts with a presentation of Tim Pohle from Johannes-Kepler-Universität Linz on "An Approach to Automatically Tracking Music Preference on Mobile Players". Ok, so the basic problem is, someday you will get bored by the music selection on your ipod. Therefore, the goal is to remove songs that you don't like anymore and replace them with new songs that you probably will like. How do you achieve this? Well, with according user feedback, i.e. by tracking the user's decision on choosing or skipping tracks. Tracks that have recently been skipped often will be dropped and replaced by tracks that are similar (according to some feature analyses) to the remaning tracks.
Next, Björn Schuller from Technische Universität Münschen is presenting on "One Day in Half an Hour: Music Thumbnailing Incorporating Harmony- and Rythm Structure". Music thumbnailing is some really cool feature, Just imagine, your sitting in your car and you are looking for another track to hear, but your player always starts songs at the beginning and they have long and boring intros. Therefore, getting to the most interesting (or significant) part of the song immediately would really be something...

The sessions close with an invited talk given by Stefan Weinzierl and Sascha Spors on "The Future of Audio Reproduction. Technology - Formats - Applications". Promissing title, let's see.... We start with a brief history of audio recording and reproduction technology starting from the very first phonograph to modern multichannel spatial surround sound systems. So, the future seems to be real sound field synthesis (wavefield synthesis, WFS) instead of relying on psycho-acustic effects as in today's stereo. Here, an array of loudspeakers reproduces exactly the wave front of the original sound source. For transmitting signals like this, no single channels are recorded anymore, but the original sound signal (without spatial characteristics of the room where it has been recorded, because this would interfere with the characteristics of the room, where it is reproduced) including movement and position of the sound source. Besides existing VRML and MPEG-4 Audio BIFS that focus more on visual scene description than on audio scene descriptions, there is the proposal of a new modeling language for high resolution spatial sound events called ASDF (Audio Scene Description Format).

[...to be continued in Adaptive Multimedia Retrieval 2008 in Berlin, June 26-27, 2008 - Day 02]

Wednesday, June 25, 2008

Visualization of large document data sets

Of course, there are a lot of books at amazon. To find a specific book, you have several possibilities. Either you try the 'search'-frame (be careful to write your keywords in the correct way as they do appear e.g. in the title) or you try to follow the categories and the proposed selections (Top10 lists, etc.) of books there. A great feature of course is the similarity based search or the search based on recommendations. This is the only way to discover something new by serendipity, something you did'nt even know to exist . In real life this way to find a book comes equal to finding by recommendation of friends or by your local librarian or book seller.

But, what about good old window-shopping. If I come to a bookstore or to a library, I love to wander around the book shelves and to look here and there and to spend (sometimes to waist...) lots of time.

Something that comes visually close to this experience is zoomii.com. It's an amazon add-on for browsing (a considerably large set of) books (they say its about 20.000 books right now) with the look and feel of book shelves, ordered by categories, with (right sized) book covers in the shelves. You can walk around, zoom in and out, and of course you can browse by categorie or search via keyword.

It's really some nice way to visualize large sets of documents. I would like to see an API for visualizing documents on the web that way!!

[via netbib.log]

Thursday, June 19, 2008

Smithsonian's Photographic Archive at flickr!

The Smithsonian's photographic Archive (at least parts of it) is available at flickr! The pictures are in high-resolution und published under Flickr commons copyright regulations, i.e. copyright-free.

Most interesting is the Folder 'Portraits of Scientists and Inventors' including many famous scientists from the 19th century (including G. Marconi from the upper left corner...). The pictures are from the Smithsonian Dibner Library of the History of Science and Technology that has a collection of more than a thousand portraits of scientists and inventors through the centuries. Only a small sampling of 144 pictures of the collection is available at flickr and gives you an idea of the range of the collection. Visit “Scientific Identity: Portraits from the Dibner Library” to see the entire collection.

Other interesting folders include 'Portraits of Artists' or 'People and the Post' from the Smithsonian's National Post Museum.

[via boing boing]

Tuesday, June 17, 2008

to boldly go....

For sure, there are a lot of nice little visualisation tools for flickr and even more has been written about tagging, folksonomies, and web 2.0 search (including the 'flickr!' article in this blog...). Nevertheless, I found a new flash-based application for flickr tag / search visualisation that is worth while taking a look: The Tag Galaxy.

Tag Galaxy visualises the search process in a bold star trek manner. Your search keywords (tags) are shown in a central star/planet with related tags spinning around it. you can refine your search by clicking on one of the small tag planets. This is only a one step ahead search...why not showing related tags of related tags....with according interrelationships...? Sounds weird? Should be worth a shot. Clicking on the central star/planet guides you to another view, where the search results (images) are shown on an animated globe. Really neat....and interesting what comes out if people have enough time to play around with flash ;-)

Anyway, the trouble with neat search visualisations often is that the visualisation is nice to look at but it gets boring after a few tries. Real good visualisation gives you more information than without the visualisation. Only if its worth while, you will keep on using it. Just think of Google Maps. Your search related with geographical information always improves your search results...and thus you will also use it next time....

Tuesday, June 10, 2008

SPAM - Vermüllte Briefkästen und die Märkische Allgemeine

Kaum hatte ich mich in meinem letzten Artikel über das Medienecho hier am HPI ausgelassen, gibt es doch noch einen weiteren Artikel in der Märkischen Allgemeinen Zeitung, in dem ich zitiert werde. in "INFORMATIONSTECHNIK: Vermüllte Briefkästen
Die Flut sogenannter Spam-Mails kostet Zeit, Geld und Energie
" von Ulrich Nettelstroth werde ich zitiert mit dem Aufruf, "Porto für E-Mails" einzuführen, um der tagtäglichen SPAM-Flut Herr zu werden. Eigentlich hatte ich erwartet, dass ein Aufschrei durch die Nation gehen würde und ich als kapitalistischer Handlanger der Globalisierungs-Aktivisten gebrandmarkt werden würde, da ich damit doch die Grundprinzipien des des ökosozial-anarchischen Internets "verrate". Aber keiner hat es gemerkt ;-)

Was steckt dahinter? Nein, es geht nicht darum, dem Verbraucher noch mehr Geld aus der Tasche zu ziehen. Stellen wir uns doch nur einmal vor, in den monatlichen Kosten unseres Internetzugangs wären 10 Cent als Portoäquivalent für ein Freikontingent von 3000 E-Mails mit enthalten. 99,99% aller "normalen" Nutzer fallen derzeit mit Sicherheit in diese Kategorie (entspricht das doch 100 versandten E-Mails pro Tag inklusive Sonn- und Feiertage, Urlaub, etc.). Keinem fällt es auf, niemand wird durch diesen Obulus vom Internet ferngehalten, da die üblichen Verbindungs- und Bereitstellungskosten jeglicher Art von Datenkommunikation ein Vielfaches dieses Betrages darstellen.

Die typischen SPAM-Versender aber, versenden Ihre Kauf-Aufrufe üblicherweise an 10 Millionen und mehr " Kunden" pro Mailing-Aktion. Hochgerechnet ergeben sich dabei so etwa 300 Euro für 10 Millionen versendete E-Mails. Ich denke, damit wäre man schon an der Rentabilitätsgrenze der SPAM-Versender angekommen. Wenn nicht, muss man den Betrag noch etwas nach oben "tunen". Heute - da der Versand von SPAM E-Mails nahezu kostenlos erfolgt, rentiert es sich für den SPAM-Versender bereits, wenn pro Mailingaktion 100 Euro hereinkommen......Tun sie das? Ich weiss nicht, ob es hierüber verlässliche Studien gibt (falls jemand dazu eine Quelle bekannt sein sollte, ich wäre für Hinweise dankbar!).

Allerdings verlangt ein Porto für E-Mails auch ein entsprechend manipulationssicheres Accounting-Verfahren, d.h. mehr Aufwand und mehr Ressourceneinsatz wären notwendig. Wenn auf der anderen Seite aber der SPAM-Datenverkehr abnehmen würde, sinkt auch die aktuelle Belastung der Datenkommunikations-Infrastruktur....und damit auch der zugehörige Energieverbrauch. Somit könnte das "Porto für E-Mails" auch für eine günstigere weltweite CO2-Bilanz sorgen ;-)

Friday, June 06, 2008

In den Medien....

Das Medien-Echo, das mir entgegenschlägt, seit ich hier am Hasso Plattner Institut in Potsdam bin, ist schon etwas anderes als das in meinen Jenaer Zeiten. So auch der hier gefundene Artikel zu der gerade in Arbeit befindlichen Neuauflage unseres Buches "WWW - Kommunikation, Internetworking, Web-Technologien". Die Märkische Allgemeine Zeitung kündigte diese bereits in einem Artikel "Vom Homo sapiens zum Homo surfiens" am 29. Mai 2008 an.

Um also die Fan-Gemeinde auf dem Laufenden zu halten: Die Arbeiten gehen voran. Neue Kapitel, wie z.B. Suchmaschinen, und Web 2.0 sind bereits geschrieben, alle Kapitel werden sorgfältig überarbeitet und auf den neusten Stand gebracht (ich kämpfe gerade mit den Varianten der Videokomprimerung....) und weitere neue Kapitel, wie z.B. Semantic Web und SOA werden folgen. Eine Aussage über den geplanten Umfang der Neuauflage kann ich noch nicht geben, befürchte aber, wir werden die 1500-Seiten-Grenze erreichen und damit könnte es dann doch noch ein zweibändiges Werk werden....


Ch. Meinel, H. Sack: WWW: Kommunikation, Internetworking, Web-Technologien, Springer, Heidelberg, 2004.

Wednesday, June 04, 2008

Semantic Web Podcast Interview

Gestern wurde ich von meinem guten alten Bekannten und Kollegen Steffen Büffel im Rahmen des 3. Dresdner Future Forums zum Thema Semantic Web interviewt. Ich war zugegebenermaßen ein wenig unvorbereitet und via skype geführte Interviews klingen immer etwas hölzern (eher blechern...), aber immerhin, hier ist es nun, mein erstes Podcast-Interview....und natürlich meinen besten Dank an Steffen.


[Link zum Original-Artikel in media-ocean]

Tuesday, May 27, 2008

Classical Mechanics made popular by Walter Lewin

As you might know, my lectures are recorded and archived ever since 2004. I really do know, how difficult it is, to fit the lecture into the exact 90 minutes frame, to develop a dramaturgy to get the student's attention. But, looking at MIT Prof. Walter Lewin's lectures on Classical Mechanics (MIT 8.01) you really can become frustrated -- well at least as being a lecturer myself. From the listener's perspective, Walter Lewin's lectures are really great! Each single lecture follows an escalating dramaturgy leading into some astonishing experiment - often with Prof. Lewin being part of the experiment himself.

Let it be a life threatening pendulum experiment (here you can jump directely to the experiment in the video), or his funny safari costume for 'shooting the monkey'. The videos are profesisonally produced - no wonder, MIT invested 100.000 $ per lecture series. Each hour of Lewin's lectures requires 40 hours of preparation -- at least if you want to keep his quality standard.

Of course, simply by watching lectures at youtube or yovisto you don't learn. This fact holds since the times of Sokrates. You need practical experience under the guidance of a teacher. But hey, it's also pure fun to listen to Prof. Lewin and there's a lot to learn, esp. for teachers....

Yovisto has indexed the entire 35 lectures of 8.01 Classical Mechanics. By registering at yovisto, you can place tags and comments to every single point of time within Lewin's videos as well as you can bookmark only the highlights out of his lectures.

Friday, May 16, 2008

bibcamp in Potsdam, 15.-17.5.2008

Bibliothek 2.0 - von der Theorie zur Praxis....
Unter diesem Motto steht das erste deutsche bibcamp, das heute hier in den 'interessanten' Räumlichkeiten (i.e. das 'Schaufenster') der FH Potsdam stattfindet. Geografisch und strategisch günstig in der Nähe des Hauptbahnhofs gelegen präsentiert sich das Gebäude-Ensemble der FH-Potsdam, das sich in direkter Nachbarschaft zur klassizistischen Nikolai-Kirche befindet (die mit ihrer riesigen Kuppel die Skyline von Potsdam dominiert), mit dem diskreten Charme des mir aus Weimar wohlbekannten 'Gauforums' (einem der größten Betonhässlichkeiten, die uns der Bauwahn des dritten Reichs hinterlassen hat...). In einer guten halben Stunde geht es los und ich werde heute vom ersten Tag des bibcamps berichten....

Wie immer beginnt die Konferenz mit der kollektiven WLAN-Suche, die sich hier für manchen Benutzer etwas schwieriger gestaltet, da das WLAN der FH-Potsdam nur in Verbindung mit einem VPN-Tunnel genutzt werden kann...insbesondere mussten wohl Benutzer eines älteren Windows eine hohe Frustrationstoleranz an den Tag legen (zumindest schaute Jörg quasi als 'Guter Geist' dem ein oder anderen über die Schulter und versuchte zu helfen).

Das bibcamp hat sogar ein eigenes Maskottchen, das allerdings noch nicht eingetroffen ist. Die Maskottchen-Idee ist über eine Google-Suche entstanden. Gab man im Vorfeld der Veranstaltung den Suchbegriff 'bibcamp' ein, wurde man von der Suchmaschine vorsichtshalber gefragt, ob man nicht doch den Begriff 'bibercamp' gemeint hatte. Was lag also näher, als den Biber als offizielles bibcamp-Maskottchen zu küren. Foto folgt...


Ok...Vorstellungsrunde. Nicht die Teilnehmer, sondern die Beiträge sind als erstes dran. 'Den Mutigen gehört die Welt' und ich bin mal wieder der erste, der sich vortraut und stelle unser Thema 'yovisto' vor. Ach ja...ich hab jetzt auch ein yovisto-T-Shirt. Foto folgt....

Interessante Beobachtung am Rande: für eine barcamp-Veranstaltung ist hier alles ein klein wenig langsam. Auch war nicht jeder 'gezwungen', sich selbst und seine Motivation vorzustellen. Diejenigen, die es taten, waren dafür aber 'sehr' ausführlich...(also nicht die berühmten drei(!) Tags).

Jetzt wird's langsam interessant. Die Diskussion entspannt sich über das bibliothekarische Selbstverständnis ('Bibliothekare haben ja viel mit Büchern zu tun') über den Bibliothek 2.0 Begriff. Vom Schlagwort zum Schlachtruf, dem Für und Wider der damit verbundenen Technologie und dem Ausgeliefertsein gegenüber der Akzeptanz der Nutzer. Nebenbei bemerkt weiss ich jetzt, dass 'Archivar' ein cooler Beruf ist, ohne den die Realisierung des Disney-Films 'Ratatouille' kaum möglich gewesen wäre....

So, nachdem wir (Jörg und ich) noch für ein Videointerview herhalten mussten beginnt jetzt das Abendprogramm mit Powerpoint-Karaoke (inklusive einem 'Chicken-Chicken'-Vortrag...). Das kalte Buffet (inklusive Schmalzbrote) war gar nicht schlecht. Der Rest des Abends spielte sich vornehmlich 'draußen vor der Tür' ab, die gut gemeinte musikalische Untermalung spielte mutterseelenalleine drinnen vor sich hin und berieselte die Reste des kalten Buffets, während sich draußen die interessanten Diskussionen entspannen. Tatsächlich endete der Abend auch 'fast' in einem Fiasko (2.0)..., als das Maskottchen (heist es nun 'Horst' oder nicht?) vorgeblich entführt wurde....

Zum bibcamp wird ja auch eine Menge getwittert....

Thursday, May 08, 2008

1st German IPv6 Summit at HPI, Potsdam (Day 02)

The first session today is about IPv6 status and deployment from ISP's (Internet Service Providers) and Operator's point of view, starting with Henning Grote from Deutsche Telekom (DT). DT is one of the big players in Germany, if it comes to IPv6. With more than 10 million end customers the transition from IPv4 to IPv6 really becomes some challenge...Thus, DT was going to present an overall strategy for their IPv6 migration....unfortunately, without giving en explicit timeframe.

Next, Yes Poppe from Tata Communications started his talk with telling the story how IPv6 came into life more than 10 years ago. Certainly interesting for someone like me, who was first involved in IPv6 at about 2000/01. Furthermore, he was referring to global IPv6 connectivity, stating that finally now you can show that there will still be Internet alhough if you completely shut off IPv4.

Wolfgang Fritsche from IABG as being the next speaker was referring to IPv6 deployment in mobile application environments with regards to results being achieved by the EU-project ENABLE. IPv6 offers special benefits for mobile networks such as a sufficiently large address space or an efficient autoconfiguration capability.

Lutz Donnerhacke from IKS Jena as being the next speaker was providing insights into IPv6 deployment from the viewpoint of an ordinary regional ISP, which are obviously of different nature compared to one of the big players such as DT or others.

In the second session -- IPv6 Applications & Developments - Carsten Hatzig from BWB/WTD81 was speaking about IPv6 deployment in military environments. 'The long way to IPv6' for the German Forces was depicted, but among the 5 phases (ranging from general preparations up to the shutdown of IPv4) we are almost only at the beginning...

Wednesday, May 07, 2008

1st German IPv6 Summit at Hasso-Plattner-Institute, Potsdam, May, 7-8, 2008

Today, I'm writing about a conference, which I had to organize during the last months: The 1st German IPv6 Summit at Hasso Plattner-Institute (HPI) in Potsdam, May 7-8, 2008. Of course I was supported by a rather commited team here at HPI and - at least up to now - everything works out fine.

The first day is reserved for almost political talks only. Although not so interesting from a technical point of view, political statements about the deployment and status of IPv6 are rather important and might help to progress with the great upcoming transition from the old IPv4 to the future internet IPv6. So, the summit started with two important keynotes by Vivane Reding, EU-Commissioner for Media and Information Society (video link to Viviane Reding's keynote) and Vinton Cerf, Chief Evangelist and Vice President of Google -- and of course 'father of the Internet' (video link to Vinton Cerf's keynote).

Other interesting talks were given by Martin Schallbruch, CIO of the Bundesministerium des Inneren about deployment of IPv6 in the public sector, Prof. Lutz Heuser, Vice President of SAP Research, on the Future Internet, and Detlef Eckert from EU Commision of Media and Information Society about the EU strategy on IPv6 deployment.

In the afternood session, internationale experiences with IPv6 deployment were reported by representatives of the Japanese IPv6 Promotion Council, the IPv6 Forum Malaysia, IPv6 Forum Korea, and Telecom Bretagne.

Anyway, we all know that we are running out of IPv4 address space soon. Although, not everybody takes this warning serious, we should face the possibility that there will be no more IPv4 addresses left after 2012. Of course there will still be the internet there, and of course you will receive also your email....but maybe not necessarely on all your mobile devices (at least if you take into account that 2008 there will be more than 3 billion mobile phones....and most of them are able to access mobile internet...well technically speaking...).

And I almost forgot to mention that yovisto.com is also exhibiting at the IPv6 summit. We succeeded in connecting our search engine to the IPv6 network (we have an IPv6 connection from FSU Jena being provided by University of Potsdam...)

IPv6 summit in the media:

Wednesday, April 30, 2008

Internet lectures now also on itunes


This semester, all recordings of my lecture 'Rechnernetze und Internettechnologie' (Computer Networks and Internet Technology, the lecture is given in German) at the Friedrich-Schiller-Universität Jena, are available via yovisto.com and also via itunes (for download). Here you will find the first lecture of the series at yovisto.com and here ist the itunes link to the lecture series.

Due to some deficiencies of the former real media recordings, we decided to encode all recordings via flash (in a much more fashionable style...see for yourself) now. The itunes version is of course mp4 with h.264 encoding.

Friday, April 25, 2008

WWW in Chinese ....

Hej, on Tuesday I received the very first copy of the Chinese translation of our book "WWW - Kommunikation, Internetworking, Web Technologien". Contrarywise to the German original it's a Chinese paperback and therefore also less expensive. Wow.....it's amazing to know that there are potentially more than 1 billion readers ;-)
Finally, I know how to spell my name in Chinese (although I can't read a word of the book...):

(德)萨克(Sack,H.)

Anyway, if you believe it or not, translating the book took more time than writing. I remember that it took me about 15 months to write these 1200 pages. But, the translation itself took more than 2 years. One of the major difficulties was that, although we gave the LaTeX source code to the translator, text being embedded in images and figures was hard to deal with (maybe because I wrote the entire book on a Linux system with xfig for drawing graphics and figures....).

All in all, the translators did a great job and I would like to thank everybody being responsible for the work being done, especially Long Wang from HPI Potsdam for the tedious work of endless proof reading and of course Christoph Meinel for establishing ties to the Chinese publisher Science Press.

Here (at dandang.com) you may find a Chinese advertisement for the book (being already) on discount for only 51.60 Yuan (instead of 69 Yuan...).

Wednesday, April 23, 2008

bibcamp 2008 in Potsdam / Berlin ahead......


After the succesful educamp 2008 in Ilmenau, I would like to draw your attention to the upcoming bibcamp in Potsdam and Berlin, on May 16-17, 2008. Like educamp 2008, bibcamp is being organized as a barcamp. Up to now, we have 45 participants, and more to come...

If you don't know what a barcamp is all about, here are the rules (....thinking about fight club):
1st Rule: You do talk about Bar Camp.
2nd Rule: You do blog about Bar Camp.
3rd Rule: If you want to present, you must write your topic and name in a presentation slot.
4th Rule: Only three word intros.
5th Rule: As many presentations at a time as facilities allow for.
6th Rule: No pre-scheduled presentations, no tourists.
7th Rule: Presentations will go on as long as they have to or until they run into another presentation slot.
8th Rule: If this is your first time at BarCamp, you HAVE to present. (Ok, you don’t really HAVE to, but try to find someone to present with, or at least ask questions and be an interactive participant.)

If you want to know more about barcamps, you might have a look at the following video being produced by the Hochschule für Medien in Stuttgart:

Link: sevenload.com

Saturday, April 19, 2008

educamp 2008 in Ilmenau

NEW: The video recording from the first day podium discussion of educamp 2008 is now available via yovisto.com.

Today, we are visiting educamp 2008 in Ilmenau (April 18-20, 2008) on behalf of FSU Jena and yovisto.com. Educam 2008 is organized as barcamp, i.e. it is almost unorganized. But, with more than 150 visitors the e-learning focus of the 'unconference' obviously has attracted a lot of people.
The day started with a short introduction of everybody with 'name + 3 tags', which really is not so simple. Just try to focus your interests in 3 tags, where at least one tag is reserved for your affiliation, one for your profession, and the last tag for your interests. Oh, but wait. The day did not really start because of a lack of coffee (which is essential for any kind of scientific working...).

For yovisto, we have managed to organize a session in the afternoon (1.30pm, 'Geistesblitz'). Therefore, we have some time for visiting other sessions, connecting to people, and public relations for yovisto... ;-).

ok. 1.30 pm. Our session is streamed live into the internet. You may find the live-stream at mogulus. From my point of view our presentation went very well. We had a lot of interesting questions including comments and advice how to improve the services of yovisto.

Unfortunately, we are a little bit too late for the upcoming session 'slidestar'. Concerning content and targets, slidestar for sure is one of our competitors. They offer an entire framework for authoring, recording, and searching lecture presentations....but only in a proprietary manner. One of the problems of slidestar seems to be the accessibility of slidestar-presentations for search-engines. slidestar is a flash-based application and therefore, linking and indexing the content is not possible for search engines like google. Anyway, they have won the 'Deutscher Bildungspreis 2008'...

Now, we are following a session on 'Exzellenz & Kompetenz' for universities and lecturers given by two speakers from meinprof.de. One of the questions being discussed is the motivation for students, which university to choose. Based on the experience of the participants, regional factors seem to be rather important, i.e. students are chosing a university near their hometown or where their friends are. Thus, is 'excellency' really the main criterion for choosing where to study...?

All in all, I really like those barcamp conferences. The athmosphere is much more liberal compared to 'hardcore' scientific conferences, because there is no strict line between gurus, experts, nobel price winners, and new commers. There's discussion everywhere and a lot of new ideas do come up. It's all about networking....and I guess also the 'hardcode' scientific conferences can learn from barcamp.
At least, there have been no conference fees. Everything (lunch, dinner, refreshments, etc.) was for free, thanks to the sponsors. I'm looking forward to my next barcamp, which will be bibcamp in Potsdam/Berlin on May, 17-18, 2008. Hope to see you there!

Tuesday, April 15, 2008

German IPv6 Summit in Potsdam


The German IPv6 summit will take place at Hasso-Plattner-Institute at the University of Potsdam (Germany) on May, 7-8, 2008. On behalf of the German IPv6 Council I am pleased to take the opportunity to invite you all to visit this important event.

The German IPv6 Summit provides two days of International, European, and German experts to address IPv6 technology at work. The first day includes talks about IPv6 strategy and deployment status worldwide, with a focus on Europe, China, Japan, USA, India, and Korea. The second day of the event will complement the program by addressing IPv6 deployment from the perspective of operators and ISPs as well as deployment in the public sector including education, administration, and defence.

As keynote speakers, there will be Vivane Reding, commissioner for information society and media at the European Commission, and Vinton Cerf, vice president and Chief Internet Evangelist for Google (and one of the "fathers" of "the internet" as far as TCP/P is concerned).

For further information including registration, conference venue and accomodation, please visit www.ipv6council.de.

Hope to see you all at Potsdam in May!

Friday, April 11, 2008

Alive and Kicking... Google 2084

Yep...after several months of abstinence, I'm back, finally.
During the last few months, a lot has happened, in science, in business, and also in private. Thus, there is a lot to talk about and a lot of forthcoming posts. Probably the most complex thing during the last months was to handle two large EU-FP7 proposals as being one of the proposal's core partners, with writing, discussing, conferencing, travelling, conferencing, discussing, and writing again, etc....
Also in my genuine research areas a lot has happened. You might be looking forward to several blog posts on information retrieval, semantic web, semantic search, multimedia retrieval, web of trust, e-Learning, and many more.

Today I just wanted to show you a funny 'screenshot' of what might Google look like in 2084.

Ok. The image is not really new. Actually it was published by Randy Siegel back in 2005. You might also find it in the New York Times. The question is not, if Google would be capable to offer these services, but rather when. Anyway, I guess theese services could be offered much earlier......and (at least) also with a much better user interface.

Tuesday, March 04, 2008

Aus Osotis wird yovisto - Web 2.0 Video Suchmaschine

Gemäß dem Motto "aus Raider wird Twix" stellt sich die Videosuchmaschine yovisto (vormals "Osotis") auf der diesjährigen CeBIT 2008 im neuen Gewand vor. Was ist neu, außer dass es einen neuen Namen und ein neues Layout gibt? Einiges!

Mittlerweile sind gut 3000 Videos akademischer Lehrveranstaltungen in deutscher und englischer Sprache in yovisto verzeichnet und durchsuchbar. Im Gegensatz zu anderen Videosuchmaschinen bietet yovisto die Möglichkeit, in den Videos selbst nach Inhalten zu suchen, d.h. wenn man z.B. nach dem Suchbegriff "Buchdruck" sucht, bekommt man nicht nur Videos als Ergebnis, die sich primär mit dem Thema Buchdruck befassen, sondern man wird punktgenau an die betreffende Szene im Video geführt, ohne lange selbst im Video danach suchen zu müssen.

Neu ist vor allem das übersichtlichere, durchdachte Layout. Insbesondere die Informationsseiten zu den Videos, die sich von den Benutzern selbst in Form eines Wikis erstellen lassen, bieten vielfach Möglichkeiten, Zusatzinformationen, Zusammenfassungen und sogar themenbezogene Werbung mit einzustellen. Am Auffälligsten aber ist die Darstellung der VIdeoinformation als 'Cover Flow', wie wir es ja schon von Apple's iPhone oder vom Leopard Betriebssystem her kennen (siehe unten)



yovisto ist auf der CeBIT 2008 in Halle 9, Stand D04 vom 4. - 9. März in Hannover und natürlich jederzeit im Netz unter www.yovisto.com zu sehen. Wer uns auf der CeBIT am Stand besucht, bekommt exklusiv einen Original yovisto-Button (limitierte Auflage) geschenkt!

P.S. Beitrag mit yovisto im Thüringen Journal (via mdr.de), vom 4.März 2008, 19 Uhr

Friday, December 07, 2007

German IPv6 Council Kickoff Meeting in Potsdam


Das deutsche IPv6 Council startete seine Aktivitäten zur Förderung des Einsatzes der nächsten Internet-Generation IPv6 mit der ersten Sitzung am 06. 12. 2007 in Potsdam. Prof. Dr. Ch. Meinel, Direktor des Hasso-Plattner-Instituts (HPI) Potsdam eröffnete die Sitzung in seiner Funktion als Chairman des deutschen Zweiges des internationalen IPv6 Forums. Die Teilnehmer des heutigen Treffens (siehe Teilnehmerliste) begrüßten die Initiative und Bereitschaft des HPI als unabhängige und unparteiliche Instanz die Federführung des Rats zu übernehmen.

Latif Ladid, Präsident des Internationalen IPv6 Forums würdigte die bisher erzielten Fortschritte und machte noch einmal in eindringlicher Weise die Bedeutung und Notwendigkeit eines solchen nationalen Gremiums zur Förderung des Einsatzes von IPv6 deutlich, Peter Demharter (IBM, Projekt DOI) stellte kurz die Inhalte und Zielsetzungen des DOI Projekts (Deutschland Online Infrastruktur) vor, das in seiner Bedeutung für den weitreichenden Einsatz der neuen Netzwerktechnologie als Leuchtturm auf vergleichbare Projektvorhaben ausstrahlen wird. Zielsetzung ist dabei die Sicherstellung der behördlichen Kommunikation über alle föderalen Grenzen hinweg, basierend auf einer Dual-Stack Lösung, mit der sowohl die alte IPv4-Konnektivität nach außen hin, als auch die neue IPv6-Konnektivität gewährleistet wird.

Gert Döring (SpaceNet AG) stellte die neue IPv6 Adressvergabe-Politik des RIPE NCC vor. Während um die letzten verbliebenen Adressbereiche des IPv4-Adressbereichs bereits ein harter Kampf entbrannt ist -- ab 2011, so die aktuelle Hochrechnung, wird es keine neuen IPv4-Adressen mehr geben -- wurden den RIRs (Regional Internet Registries) IPv6 /12-Adressblöcke zugewiesen, die jeweils ausreichen, um bis zu einer Million ISPs zu bedienen. Jedem einzelnen ISP (Internet Service Provider) bleiben dabei üblicherweise /32-Adressbereiche (für große ISPs wurden schon /19 Adressbereiche vergeben, so z.B. für die Deutsche Telekom), mit denen etwa 65.000 einzelne Kunden bedient werden können, während für End-User /48 - /64 Adressen vorgesehen sind.

Ein generelles und gravierendes Problem, das einem Durchbruch der neuen Internet-Technologie im Wege steht, liegt darin begründet, dass die IPv4/IPv6-Problematik noch nicht auf der Ebene der Entscheider angekommen ist und ein derartiges Problembewußtsein erst noch geschaffen werden muss. Darüberhinaus ist allen Teilnehmern klar, dass ein Durchbruch der neuen Technologie nur über den Massenmarkt der Endverbraucher möglich wird. Da die IPv6-Problematik bislang in den Medien -- wenn überhaupt -- nicht in der erforderlichen Dringlichkeit dargestellt wurde, muss sich das deutsche IPv6 Council einen Schwerpunkt auf die Themen Dissemination und Public Relations setzen.

In diesem Sinne wurde von den Teilnehmer beschlossen "IPv6 zu leben", und zwar jetzt. In diesem Sinne sollte folgender treffende Slogan verwendet werden, der seinerzeit IPv6 die parlamentarische Unterstützung sicherstellte: "IPv4 verbindet Computer -- IPv6 dagegen verbindet Menschen".

Um die projektierten Ziele zu erreichen wurden mehrere Initiativen vorgeschlagen und beschlossen, gemeinsam eine IPv6 Roadmap für Deutschland zu entwickeln, die von den bereits existierenden Materialien der internationalen IPv6 Organisationen profitieren kann, um die darin aufgezeigten Lösungsmöglichkeiten an die speziell deutschen Problemstellungen anzupassen.

Zu Demonstrationszwecken sollen am HPI mehrere Showcases mit Unterstützung der Council-Mitglieder implementiert werden, um der Öffentlichkeit die Vorteile und Möglichkeiten der neuen Internet-Technologie vor Augen zu führen. Eine erste öffentlichkeitswirksame Präsentation ist im Rahmen der Computermesse CeBIT 2008 geplant.

Im Fortgang ist ein deutscher IPv6 Summit für den Zeitraum 14.-16. Mai 2008 geplant, der ebenfalls am HPI Potsdam durchgeführt werden soll. Neben Expertenvorträgen zum internationalen und nationalen Erfahrungsaustausch ist dabei ein spezieller IPv6 Tutoriumstag vorgesehen, der Anwendern und Entscheidern einen fundierten Einblick in die neue Technologie ermöglicht.

Friday, October 05, 2007

Evolution of Semantic Systems (ESS 2007) in Jena, Oct 5th, 2007 - Day 01

Yesterday evening, the Symposion on Evolution of Semantic Systems (ESS 2007) started with a welcome dinner in Jena. This symposium attempts to foster a better understanding of semantic systems from an evolutionary perspective. As the conference's abstract says, any complex system in nature and culture is using information for the development of internal organisation and the control of functional mechanisms. Without the storage, transmission, and processing of information, the evolution of complexity would have been impossible. Furthermore, information always has a semantic dimension, which can be described in terms of meaning, sense, reference, and function and thuis, the study of semantics must play a decisive role in the analysis of complex systems. The Symposion on 'Evolution of Semantic Systems' gatheres a large variety of scientific disciplines ranging from philosophy over computer science, bio-informatics, didactics, and linguistics and we are looking forward to many rather interesting invited talks and discussions.

I'm not quite sure, whether I will have WLAN access in the university main building. Nevertheless, I'm recording and commenting the conference's programme. So, if live-blogging is not possible, I will post an entire report in the evening.

This time I have a very difficult job to summarize and comment the talks I have heard because they come from completely unfamiliar areas. Esp. in philosophy I can follow to a certain degree, but soon I'm lost completely (e.g. if it comes to discussion different views of people who the philosophers in the audience are of course familiar with...but I do lack the semantics....). Anyway, I'll try to do my best and hope not to make too many mistakes.......

Bernd-Olaf Küpers: The Evolution of Semantic Systems - Background
"nothing makes sense in biology except in the light of (evolution) information"
Life = matter + information

Opening with these two quotes the first talk in the symposion started to give some background information on the symposion's main subject.
So what is information?
First, let's consider the syntactic dimension of information, as e.g. in a dna sequence. There, the principle difference between physical and biological systems becomes obvious: boundaries or constraints. While in physical systems the organization of the system does not depent on its boundaries , i.e. it has contingent boundaries (e.g. gas molecules in a gas tank do not change their organization, if walls of gas tank are changed), in biological systems boundaries and constraints are non-nontingent. Therefore, small changes will have a significant effect on the entire system (e.g. exchanging a nucleotide in a DNA sequence).
On the other hand let's consider the semantic dimension of information, i.e. content & meaning. There, we might refer to similarities between human language and genetic code (as being a molecular language, where nucleotides are referring to single characters, nocleotide sequences to morphemes, genes to words, and so on ....). We discover that both human and molecular language have common features in syntax and structure, such as linear sequence, unidirectionality, aperiodicity, recursivity, and hierarchical structure. And on the level of semantics and functions, we identify common features, such as contingency and context dependency.
Language seems to be a basic principle of complex systems in nature. Starting with Shannon's (statistical) theory of information, we add properties such as novelty, pragmatics, complexity, to conclude that 'meaningful informationÖalways refers to 'information of maximum complexity'.

In the second part of the talk, structural sciences (how reality can be strucutred) as a new branch of science that has evolved over the last 50 years is addressed (e.g. with single topics ranging from system theory even to chaos theory, including philosophy, physics, or computer science).
Structural science overlaps (in its borders) with the traditional (natural) sciences and humanities. It's objects of cognition are structures and relationships, the methods of cognition causal analysis and analogical form, bound by structural laws as well as boundaries and constraints.
This is leading to the foundation of the Frege Center for Structural Sciences at the University of Jena with its first project: The Evolution of Semantic Systems

In the second presentation Dagfinn Follesdal from Stanford University was talking about 'The Emergence of Reference'. Forst he addressed several of the important questions, which have been previously posted on the symposion's homepage.

  1. What are paradigmatic examples of semantic systems? E.g., formal languages, natural languages, etc....A System can be defined as a structure, that is, aset of entities (nodes) with relations dewfined between them. Syntax and semantics are dependent on each other, phonology finds the distinctions, where they matter for semantics.

  2. Semantic vs nonsemantic systems -- is the a line of demarcation between semantic and non-semantic systems?
    A system becomes a semantic system, when it becomes endowed with meaning.
    A structure becomes a language, if it is used for communication (e.g. a slide rule is a semantic system (uses cmore complicated mathematical structure...instead of multiplying numbers one adds its exponents)..but not the physical system itself is semantic, but the system under a specific interpretation).
    A system becomes semantic by use in publicly obvservable ways (behaviouristic view) (therefore, as e.g., a neural network is no semantic structure...they do not become endowed with meaning through a public process).


We speak of Holism, if changes in one part of the structure tend to affect to entire structure. The semantic interpretation always is affected by new knowledge. Frege distinguished two branches of semantics: theory of sense and theory of reference. All three basic kinds of expressions, singular terms, general terms and sentences all function in the same way: sense determines reference. Objects have three features that are crucial for reference: (1) transcendence (2) change over time and (3) fallibility (we may have false beliefs about objects).

Next, Rüdiger Inhetveen and Bernhard Schieman form the University of Erlangen/Nürnberg were giving a presentation on Talking about Structures. So the basic question was 'what are structures and what are they used for...?' (-> Eulers Könisberger Brückenproblem....instead of walking, Eurler transfered the problem to a structure....here, a graph) Thus, structures (more or less complex) may be used for mapping between different systems of different complexity (at least as far as I have undrstood the talk).

Christoph von der Malsburg from the University of Frankfurt / University of Southern California was next with 'About the Organization of Semantic Systems'. First, he spoke about invariant object recognition, the cognitive model of the brain and more general, how does the brain work? For illustration, he presented the following example:
the Binding Problem: consider a blue square and a red triangle. Then in your brain are two sets of neurons, one for considering if a colour is blue or red and another one identifying quares and triangles. But, to get bot together and to recognize which shape has what colour, you are going to need more neurons (thats the binding problem)....which again leads to new binding problems. Are there universal principles of the evolution and organization of semantic systems? -> Yes!!
Then, he also referred to the basic principle questions that have been posed previous to the symposion: What are paradigmatic examples of semantic systems? Formal languages of course....only if you decypher symbols into instances...they don't have meanings by themselves. Programming languages are also not semantic by themselves, but creation and interpretation of natural language requires lots of semantics. Finally, Malsburg concluded that social systems are nothing but a higher level version of the brain....

Today's last presenter was Barry Smith from the University of Buffalo on 'The Evolution of Biomedical Ontologies'. He started his talk with a reference to 'the olden days', when people had difficulties in comparing units of measuring lengths. The situation completely changed, when in June 22, 1799 the Paris meter and kilogramm had been proposed, which startet an evolutionary process that lead to the development of well known SI base units. SI is an ontology and in the same time a controlled vocabulary. It is the achievement of the SI system that makes scientific results comparable. On the other hand, controlled vocabularies are indispensible for scientific advance. Anyway, defining ontologies in biology is much more complex than defining a system of measurement such as SI. There, the Gene Ontology starting in 1999 served as a starting point for the development of obo( open biomedical ontologies). Obo is a shared portal for (so far) 58 ontologies.

Friday, September 28, 2007

CSSW 2007 - Conf. on Social Semantic Web in Leipzig - Aftermath

Back again -- this time in Jena -- I have a few minutes left to draw some resume about the last to days at CSSW 2007.

But first, I have to continue, where I left the day before. The final event yesterday (before the conference diner) was a panel discussion on the topic 'Is there a Social Semantic Web?' with Kingsley Idehen, Marc Fleischmann, KJlaus-Peter Fähnrich, Matthias Bärwolf, and myself. First at all, we did not provide any valid answer to the overall question. In the end, we were discussing, why the Semantic Web does not get the right attention, about (working) business models in Social Web and Social Semantic Web, and about incentives for participation (as well as on open source and open data)....

Finally, I really enjoyed this conference (esp. if you consider my bad experiences with the two conferences last week). O.k., maybe this was because the conference's topic was closer to my core interests compared to e-Learning (DeLFI) or XML (XML-Tage Berlin). There were also much more interesting people to meet, esp. I'm looking forward to meet again a collegue at Potsdam (Universität Potsdam, not HPI). I think, the basic question (as mentioned before) 'Is there a Social Semantic Web?' can be answered with YES. Why?...simply because many Social Semantic Web applications have been resented in the last two days, ranging from Semantic Wikis (where semantic data is authored in a collaborative way), over social networking applications that make use of implicit (semantic) data (and vice versa). I also guess that the topic will become even more important in the very next years. One major point (or objection) was the question about a (working) business model and the incentives being necessary to convince people to participate. It's rather difficult to predict the people's attitude and behaviour. Of course, maybe it's just 'ease of use'. Wikipedia is successful, because everybody can participate with only small effort (he/she must only be able to speak the Wiki-Language). Given that most people have some extraverted tendencies, given the user community of an aplication is of sufficient size, social reputation is an important incentive fr participation. But first, you need to have a sufficient large user community to attract users. Thus, it's similar to the chicken-and-egg problem....
Nevertheless, I hope this tpoic will also be present at this year's IWLS in Korea....

Thursday, September 27, 2007

CSSW 2007 - Conf. on Social Semantic Web in Leipzig, Sep. 27th 2007 - Day 02

The second day of the Social Semantic Web Conference (CSSW 2007 here in Leipzig. Fortunately, Weimar has a rather good train connection to Leipzig (approx. 55 minutes...). Thus, I can sleep at home and don't have to stay in a hotel in Leipzig (and ofcourse the costs for travelling will be reduced...as we have a very low budget at the University for conference travells). Ok, first thing I had to learn was, the conference catering isn't really as bad as I had written yesterday. Actually, you get 5 tickets for refreshments every day (not for the whole conference). Thus, you won't die of thirst ;-)
(the Picture above is showing the University of Leipzig, Jahn Campus, where CSSW 2007 together with SABRE07 takes place)

But, back to the conference program. Today, I will chair the first session and therefore, I'm not able to blog live (at least during the morning). But I will write about the talks as soon as I find some spare time. The following talks are scheduled: Patrick Maué from the University of Münster with the topic 'Collaborative Metadata for Geographic Information'. Patrick is addressing so called participatory Geographic Information Systems (GIS) that are used for decision making, as e.g. in urban planning. Usually this data is published as a catalogue on the web. But, queries to this catalogue suffer of very bad recall and precision. This also comes from the dicersity of people involved in the process of generating metadata. People have different perspective, mental models, and terminology. Thus, the general problem being addressed is refinement of shared metadata. Of the three diffeent levels of semantics ( implicit semantics whic means data gathered from statistical analysis of the original data, soft semantics such as e.g. folksonomies, and formal semantics that allow proper reasoning), Patrick addresses the gap between implicit and soft semantics, and tries to bridge this gap by detecting similarities of metadata.

The following talk by Thomas Riechert has the title'Mapping Cognitive Models to Social Spaces - Lightweight Collaborative Development of Project Ontologies'. There, an application of the SoftWiki ontology for requirement analysis (SWORE) in the software development process is presented. Software development is carried out collaboratively by the different stakeholders of the process with the goal to model an application on an abstract level from different points of view. Stakeholders formulate their requirements in natural language and use the SoftWiki for coordination and aggregation of the requirement model, the project model, and the bug model. Then, stakeholders tag the requirements and from the tags the system extracts relationships between requirements that finally end up in the project model (=project ontology). The content of the three models serves as input to the Software Development process (a.k.a. CASE-Tool), which maps the models to UML and in this way creates the input data for the software developers.

Sören Auer (also co-chair of the conference) concludes the session with a talk on'DBpedia Relationship Finder'. Firt at al, DBpedia is an interesting project that makes use of the (inherent) structured data in wikipedia. This structured data can be found in so called 'info-boxes', i.e. tables usually put in a columns right of the wikipedia article containing data in a well structured (and hopefully commonly agreed) format. This data can be extracted from the wikipedia dump. The structured data is transformed into RDF-Triples that constitute a huge graph. The goal of DBpedia is (in the end) to enable the user to ask complex queries on the structured wikipedia data. The general problem anyway is to visualize this huge amount of data in an efficient way. The relationship finder tries to draw connections between two terms in DBpedia and therefore traverses the DBpedia graph trying to find paths (via different properties being specified by extracted content the original wikipedia info-boxes). These paths are presented in shortest path first order. Up to now, no ranking of paths with similar length is performed. Interesting thing to mention ist that the terms 'Leipzig' and 'Semantic Web' are connected via the property of Leipzig being the city, where Johann Sebastian Bach died ;-)
To get more information, you can exclude certain properties from the result paths (i.d. then only paths, which don't include that specific property are included).

The upcoming session is focusses on presentations around the SoftWiki project (as also was the talk of Thomas Riechert in the first session). The following talks are scheduled: Kim Lauenroth is presenting 'A Processmodel for Wiki-based Requirements Engineering Supported by Semantic Web Technologies', followed by Haiko Cyriaks with 'Supporting Requirements Elicitation by Semantic Preprocessing of Document Collections', followed by Steffen Lohmann with 'Ways of Participation and Development of Shared Understanding in Distributed Requirements Engineering', and Thomas Riechert with 'Towards Semantic Based Requirements Engineering'.

The Session after the lunch break starts with Stefan Kröger from the University of Potsdam with 'Analysing Wiki-based Networks with SONIVIS. The main goal of the Project SONIVIS is (or at least one of the goals...) the Unterstanding of Emergence of Knowledge in Social Knowledge Spaces. As a tool, SONIVIS integrates analysis, evaluation, visualization, and data handling of wiki-based networks.
Rainer Hammwöhner from the University of Regensburg is next talking about 'Semantic Wikipedia -- Checking the Premisses'. For this reason, they took samples from (different multilingual versions of) wikipedia and tested, as e.g., if the wikipedia category system really is a sound taxonomy (...I would say 'no'!). As we already have thought, the category system makes inadequate use of hierarchies, and that the quality of different language versions varies. There seems to be a high amount of disagreement in the category systems of the different language versions.
Joshua Bacher from the Max Planck Institute for Evolutionary Anthropology is giving a Demo Talk on 'BoWiki' - a collaborative editor for biomedical ontologies, gene functions and annotations (originally derived from Semantic MediaWiki, but for being able to use reasoning SMW was given up).
The next demo is given by Marc Fleischmann entitled sMeet-Let's Meet real, a web platform to talk and to sozialice (in an synchronous way) just as in real life. sMeet constitutes a 3D Avatar based virtual community (just as 2nd Live), but connects the virtual world with the phone system....(really an entertaining presentation, I even saw my very first live iPhone...But, I`m missing semantics....). The phone system enables real mobility (a kind of ambient 2nd Live....and the phone system is also something that people are used to pay for) and with sMeet several heterogeneous communities are (at least planned) to be connected.
Richard Cyganiak closes the session with a presentation on 'DBpedia - a Nucleus for a Web of Open Data' providing more background information on the DBpedia project. In DBpedia, every item has an own URI. Simply take the wikipedia URI of an article and substitute 'www.wikipedia.org/wiki' by 'www.dbpedia.org/resource' (here you might find the DBpedia resource named 'Leipzig'). Thus, DBpedia becomes a repository for Semantic Web identifiers.

The final event of today (before the conference diner) was a panel discussion on the topic 'Is there a Social Semantic Web?', in which I took part (therefore no live-blogging). I will refer to that in my next post....

Wednesday, September 26, 2007

CSSW 2007 - Conf. on Social Semantic Web in Leipzig - Sep 26th, 2007

For the next two days I'm going to participate at the CSSW 2007 (Conference on Social Semantic Web) in Leipzig.

The sessions will start at 10am. WLAN is working, live blogging in progress ;-)
Ok, I should not start complaining before it even starts. But, 195 Euros conference fee for 2 days (including allowance for being an 'active' participant) and you have to pay for refreshments in the conference breaks? Ok...we have also received some refreshment tickets...incredibly valuable...representing 5 Euros (of the 195 Euros conference fee)...but one glass of water (or one cup of coffee) is 1 Euro. This makes 5 glasses of water for two days. Anyway.......back to the conference ;-)

After a brief intro by the conference chair Sören Auer, the first talk is presented by Andreas Hees on 'Alternative Searching Services: Seven Theses on the Importance of Social Bookmarking' prommising an interesting combination of traditional search engine indexing and the use of taging. A major difference in both approches lays in the coverage of web pages. While search engines encompass almost up to 85% percent of the 'Surface Web', manual tagging only coveres about a fraction of that. This leads to

  • thesis 1: Limited but Growing Coverage (of Social Bookmarking Services).

  • thesis 2: A smaller index of Social Bookmarking Services does not mean less quality.

  • thesis 3: Less frequent update of Social Bookmarking indexes.

  • thesis 4: The larger the community the more likely users will find specific content

  • thesis 5: SBS are less prone to manipulation (remember the Google vs. BMW case...).

  • thesis 6: SBS are perceived to be more trustworthy than algorithmic search engines

  • thesis 7: Quality of assigned tags will improve


Ok...we all agree on the advantages that SBS do offer. But...how to combine the traditional search engine results with SBS results. This raises issues concerning indexing, actuality, inconsistency, ranking, etc...
The presentation only offers some tag suggestion and tag auto correction mechanisms....so 'how to get better quality tags'. This does not solve the general problem of combining both services.

Rico Landefeld is next, presenting 'Collaborative Web Publishing with a Semantic Wiki', presenting our SemanticWiki implementation Maariwa. One of the questions at the end of the talk concerned the very important topic of the extra 'effort' invested by the author that is necessary for the creation of semantic annotation. The benefit for all the other users is obvious, but for the author himself? Therefore, the extra 'effort' has to be minimized to the limit to make semantic annotation also attractive for the ordinary user.

Mohamed Bishr from the University of Münster concludes the session with 'Weaving Space and Time into the Web of Trust'. What is the influence of spatial dimension on trust? There is evidence of the effects of social network structures on trust as well as of geography on social networking. So the goal is a theory of the dynamics of trust in social networks with respect to the spatio-temporal regularities of the social networks.

The second session is more focused on Semantic Technology. In the first talk, Uldis Bojärs from DERI Galway ist talking about `A Prototype to Explore Content and Context on Social Community Sites`, the SIOC-Explorer ('you may call it RSS on steroids'....as Uldis says). SIOC (pronounces as shok) stands for Semantically Interlinked Online Communities is a W3C submission and is based on an ontology representing the process of social networking on the web (at the SIOC website there is even a wordpress plugin for your blog)....and with the SIOC Explorer SIOC data are crawled and aggregated for browsing and exploring. Furthermore it enables faceted browsing.

The next talk is on 'Adapting an ORDBMS for RDF Storage and Mapping' and is presented by Orri Erling. According to the title a native Relational Database System (Virtuoso) is adapted for RDF. Christian Weiske concludes the session with a talk on 'Implementing SPARQL support for RDBMS and possible enhancments'. To achieve this, a new SPARQL engine was developed and integrated into a database in a way that most of the work load will be accomplished by the database.

So far, so good...Lunch took place nearby in separate room of the Mensa (a review will soon available at küchenschreck and -- the world is small -- I've made some new enjoyable acquaintance, also coming from Potsdam. The afternoon session starts with an invited talk of Hans Hartmann with the topic 'SOA for IT? Hype, Trap, or Hoax?'. First of all, there is a SOA Hype, nobody can deny that. For an illustration, Hartmann compares tha SOA Hype with the excitement of an Austrian radio soccer reporter "....Tor! Tor! Tor! Tor! Tor! - I werd' narrisch! Krankl schießt ein, 3:2 für Österreich" (here you may find the original audio file as mp3).

Next, there is the afternoon's poster (+ demo) session with 5 short presentations. Santtu Toivonen from VTT talks on 'Mobile Social Media - General Characteristics and Interfaces with the Semantic Web'. He starts with a general review on Social Networking Companies and organizes them according to their underlying business models....but in the end, there are no mobile semantic web applications arround.alt least not yet. Next talk on 'Galaxy: IBM Ontological Network Miner' is given by John Judge working at project NEPOMUK (remember the semantic desktop...). I guess I have reviewed this paper...but it has left no remarkable impression so far. Next, Andreas Walter from FZI Karlsruhe on 'IMAGENOTION: Collaborative Semantic Annotation of Images and Work Integrated Creation of Ontologies' being motivated by Image based navigation in Multimedia-Archives. An Imagenotion represents a semantic notion graphically through an image, aggregating synonyms, Labels, links to web pages and other kinds of annotations. In the end, Imagenotions -- if I have got this right -- should ripe (= Ontology Maturing) to real ontologies). Finally....a live presentation (c.f. www.imagenotion.com). The last talk is presented by Philipp Heim from the University Duisburg-Essen on 'Semantic Integrator- Semi-Automatically Enhancing Social Semantic Web Environments.

As my battery is slowly fading away, this was the last paragraph for today (as being also the last talk). There is a 'Sächsischer Abend' announced in the conference schedule and as for my train is leaving at about 7pm, I still have some time left to take a drink. See you tomorrow ;-)

Tuesday, September 25, 2007

XML-Tage 2007 in Berlin - Tag 2 und Resumé

Dienstag, der Sommer scheint endgültig vorbei. Nach einem letzten Aufbäumen in den vergangenen Tagen -- gestern hatte man in Berlin noch einmal den Eindruck, als wäre es gerade Hochsommer. Jede Grünfläche, jede freie Bank in der Sonne und am Wasser wurde 'besetzt', überall saßen die Menschen noch bis spät in den Abend bei lauen Temperaturen. Und heute morgen....alles grau in grau, es wird nicht richtig hell und es nieselregnet so vor sich hin...
(Bild: Apotheose des Antonius...aus einem Treppenaufgang im Seitenflügel der HU Berlin...ich liebe diese klassizistischen Universitäts-'Tempel')

Aber...perfektes Wetter, um den ganzen Tag in mehr oder minder interessanten Vorträgen zu verbringen ohne den permanenten Drang, das ach so perfekte Wetter zu nutzen. Allerdings waren die im Rahmen des wissenschaftlichen Programms angekündigkten Vorträge heute nur von sehr geringem Interesse. Ich werde sie hier zwar nennen, aber nicht näher darauf eingehen:
  • Semantic Registries - Cataloguing eGovernment Resources, M. W. Küster (FH Worms)

  • Applying GRIPP to XML Documents containing XInclude, S. Trißl, F. Zipser, U. Leser

  • Applying the Operational Hierarchy Concept to XSLT, D. Foetsch, E. Pulvermüller

  • Analysis on Inference Mechanisms for Schema-Driven Forms Generation, J. Spillner, A. Schill

  • XML-Strukturen für Worterklärung von Verben, G. Büchel

Statt an den wissenschaftlichen Vorträgen weiter teilzunehmen, habe ich mich in eines der parallel stattfindenden Wirtschaftsforen mit hinein gesetzt und habe mir einen Vortrag von W. Bernhardt vom Bundesministerium der Justiz zum Thema 'Arbeiten mit strukturierten Daten im grenzüberschreitenden Rechtsverkehr' angehört. Der Zweck ist klar: Die Schaffung der Möglichkeit einer grenzüberschreitende (juristischen) Recherche. Dazu wäre es natürlich wünschenswert, wenn sich entsprechende Daten entweder (regelbasiert) ineinander automatisch übersetzen lassen bzw. wenn (zumindest etwa europaweit) standardisierte Daten und Datenformate genutzt werden könnten. Probleme, die dabei auftreten wären Inkonsistenzen, mit denen gerechnet werden muss, wenn Daten redundant (doppelt) gehalten werden und die nur schwierig zu realisierende permanente Aktualisierung dieser Daten. Das Ganze wurde mit Beispielen aus dem Bereich Strafrecht, Wirtschaft und Bankenwesen verdeutlicht. Zusätzlich erschweren auch noch unterschiedliche nationale rechtliche Einschränkungen die grenzüberschreitende Nutzung dieser Daten. Erfrischend, einmal einen solchen Vortrag aus direkter 'Anwender'-Sicht zu hören.

Leider kann ich morgen nicht mehr an den angekündigten Foren zum Thema 'E-Health' und 'E-Government' teilnehmen (Immerhin habe ich Matthias und Justus Bescheid gegeben, dass die Teilnahme an den Wirtschaftsforen kostenlos ist...).

Tja....zusammenfassend, wie soll ich diese Konferenz (nach der Enttäuschung mit der DeLFI 2007 in der vergangenen Woche) werten? Gestern gab es immerhin zwei Vorträge, die ich 'hinreichend' interessant fand. Neues bzw. neue Anregungen hat mir das Ganze leider wieder nicht gebracht. Dazu schien das wissenschaftliche Programm mit gerade einmal 10(+2) Vorträgen (!), die sich über das gesamte Themenspektrum 'XML' bewegten, leider ziemlich 'dünn' geraten. Vom rein wissenschaftlichen Standpunkt aus, rentiert sich eine Teilnahme (zumindest für uns) nicht. Die Wirtschaftsforen dagegen bringen schon etwas interessantere Themen (zudem ist die Teilnahme kostenlos). Zu den Wirtschaftsforen zählen auch die im Foyer der HU Berlin postierten Aussteller- und Informationsstände. Dort habe ich gestern auch Kollegen von Hylos getroffen, die mit einem eigenen Stand präsent waren. Bei einer kurzen Unterhaltung ergab sich allerdings, dass gerade einmal 2 (in Worten 'zwei') interessierte Konferenzteilnehmer an den Stand kamen (so dass sich die Präsenz dort nicht wirklich rentierte...).

Fazit: Sollte ich im kommenden Jahr wieder Zeit haben, um hier vorbeizuschauen, werde ich mich wahrscheinlich lediglich zu den Wirtschaftsforen anmelden und nicht am wissenschaftlichen Programm teilnehmen.
Morgen geht es weiter in Leipzig mit der Conference on Social Semantic Web über die hier natürlich wieder berichtet werden wird...

Monday, September 24, 2007

XML-Tage 2007 in Berlin, 24.09.2007 - Tag 1

Die kommenden zwei Tage werde ich mich auf den XML-Tagen 2007 in Berlin aufhalten und täglich live aus dem wissenschaftlichen Programm hier aus dem historischen Hauptgebäude Unter den Linden berichten. Wobei....der Tag fängt ja schon gut an, ich komme -- natürlich -- etwas später, verpasse die Begrüßung, den ersten eingeladenen Redner und die erste Vormittagssession und komme gerade rechtzeitig zur Kaffeepause um 11 Uhr. Hier hoffe ich Rico zu treffen, der unser Paper 'Maariwa - Ontologie-basiertes Web-Publishing mit einem Semantic Wiki' am Nachmittag vorstellen wird....

Na perfekt, das wissenschaftliche Programm macht erst einmal von 11 Uhr bis 13.30 Uhr Pause....und Rico ist noch gar nicht da, sondern übt seinen Vortrag -- hat aber versprochen, bis 15 Uhr hier zu sein. Naja...MIttagessen gibt es dann im 'Cum Laude' und dann werde ich mal sehen, ob sich hier irgendetwas interessantes bis zum frühen Nachmittag abspielen wird....

O.k., ich hab doch noch eine 'sinnvolle' Beschäftigung gefunden. Im Senatssaal findet eine Podiumsdiskussion zum Thema 'Wachstum durch Cluster' statt. Hans Peter Hiepe vom Bundesministerium für Bildung und Forschung, Dr. Joachim Quantz und Reiner Thiem von xmlcity:berlin, Dr. Carsten Rudolph von Microsoft und Prof. Jochen Schiller, Vizepräsident der FU Berlin. Nach einer kurzen Einleitung und Vorstellung der Teilnehmer durch Herrn Thiem äußern sich zunächst alle Teilnehmer der Podiumsdiskussion zum Thema 'Innovationsförderung'. Obwohl -- so Prof. Schiller -- an den deutschen Hochschulen verstärkt gegründet wird (insbesondere auch an der FU Berlin), geht -- so Dr. Rudolph -- die Anzahl der Firmengründungen an Universitäten insgesamt zurück. Herr Hiepe kritisiert die an den Hochschulen übliche Praxis, Gründungsinitiativen -- wie das durch ihn initiierte EXIST Programm (von dem ja auch OSOTIS profitiert hat) -- zu sehr wie ein akademischer Aufgabenkatalog im Rahmen einer Art Weiterqualifizierung abgehandelt werden. Kritik an den Existenzgründungsinitiativen und Programmen wird am jeweils zu starren zeitlichen Rahmen geäußert, den diese unterliegen. Das EXIST-SEED Programm fördert -- ohne anschließende Evaluation -- jedes Projekt, das sich dafür qualifiziert für genau ein Jahr. Wie sich das zu gründende Unternehmen danach finanziert, darum muss sich das Unternehmen selbst kümmern. Leider gibt es viele Projekte, die weitere 6 Monate Entwicklungsarbeit benötigen, die leider nicht weitergefördert werden können. Hier wäre auch meiner Meinung nach Bedarf, nach einer entsprechenden Evaluation die Möglichkeit einer Weiterförderung einzuräumen.

So...leider herrschte im lokalen WLAN seit dem M ittagessen Funkstille....soviel zum Thema Live-Blogging. Nun ja....hier also mein Bericht von unserem Nachmitteg:

Lutz Suhrbier (FU Berlin) 'Vertrauenswürdige Aussagen im Semantic Web'
Vertrauen ist -- laut Niklas Luhman - nichts anderes als eine Art Mechanismus zur Reduktion sozialer Komplexität. Man leistet eine Art 'riskante Vorleistung' als potentiell unsichere Investition in die Zukunft. Nachdem dieser Begriff geklärt wäre, benötigt man für das Thema noch das 'Semantic Web' (als Anwendungsraum) und die 'Inferenz' oder das 'Reasoning', dass uns in die Lage versetzt, aus vorhandener Information neue Information zu schließen. Das eigentliche Problem reduziert sich dann darauf, wie eine Absicherung der im Semantic Web befindlichen Aussagen getroffen werden kann -- und welche Auswirkungen hat dies auf die Verlässlichkeit von daraus hergeleiteten Aussagen?
Lösungsansätze sollten dabei Autorenbezogen sein. Dazu muss als erstes zweifelsfrei zugeordnet werden können, Aussage X ist von Autor Y getroffen worden. Das ist nur möglich, wenn jede Aussage (und jeder Autor) einen persistenten Bezeichner (also z.B. einen URN) besitzt. Daraufhin hat dann auch jeder Benutzer Z die Möglichkeit, eine von Y getroffene Aussage zu bestätigen oder nicht. Zudem kann eine Aussage auch von einer über jeglichen Zweifel erhabenen Instanz (also einer Art 'Trust-Center') beglaubigt werden. Darauf aufbauend wurde ein Konzept einer 'Trusted Semantic Web Infrastructure' inklusive 'Semantic Web Zertifizierungsstelle' und 'RDF-Zertifikaten' vorgestellt....allerdings eben nur als Konzept und nicht im Sinne einer konkreten Implementation. Also....irgendwie war mir das alles bereits bekannt bzw. war die Übertragung auf das Semantic Web 'straightforward'.

Um 15 Uhr sollte ja eigentlich Ricos Vortrag starten -- nur fand sich kein Session-Chair. Also wurde gut 10 Minuten gewartet und daraufhin eröffneten wir als Teilnehmer die Sitzung kurzerhand selbst. Rico stellt dabei das Ergebnis seiner Diplomarbeit -- eine SemanticWiki-Implementierung mit Namen 'Maariwa' vor. Nach einer Analyse bereits vorhandener Ansätze wurde nach den Schlüsselkriterien gesucht, die einen guten Kompromiss zwischen semantischer Ausdrucksstärke und Benutzbarkeit (auch und vor allem durch Nicht-Experten) ausmachen. Das Ergebnis ist eine graphische Benutzeroberfläche (keine eigene Wiki-Sprache), entsprechende Import-/Export-möglichkeiten, und eine möglichst einfache - wobei aber durchaus semantisch ausdrucksstarke Abfragesprache, um 'Wissen' aus dem SemanticWiki herauszuziehen.

Im Anschluss trägt Stefan Thalmann von der Universität Innsbruck zum Thema 'Kollaboratives Tagging zur inhaltlichen Beschreibung von Lern- und Wissensressourcen' vor. Zunächst wird auf die unterschidlichen Varianten der Metadatengenerierung eingegangen. Während subjektive Metadaten wie Schlagworte oder Bewertungen nur schwierig oder gar nicht auf automatische Weise zu gewinnen sind, lassen sich objektive Metadaten (Charakteristika) relativ einfach automatisch gewinnen. Danach wird auf das 'Tagging' eingegangen. Allerdings unterscheidet Thalbach lediglich zwischen allgemein anerkannten 'objektiven' Tags und persönlichen 'subjektiven' Tags und verzichtet auf eine weitere Untergliederung in objektive und funktionale Tags, wobei hier objektive Tags inhärente Eigenschaften und Charakteristika der Ressource kennzeichnen, während funktionale Tags stets in einem speziellen Kontext stehen. Eine nachfolgende Studie, in der Studierende mit Taggingaufgaben betraut wurden, brachte meines Erachtens aber auch keine bahnbrechenden neuen Erkentnisse.

So...kurze Nachbesprechung/Maneuverkritik mit Rico, E-Mails beantworten (da das Netz wieder funktioniert, Bloggen, und warten auf das 'Networking am Buffet'...
Apropos Buffet: dieses war zwar recht lecker (immerhin gabe es guten Wein und ein leckeres kaltes Buffet), aber mit dem Networking haperte es dann doch, da vielleicht höchstens 20-30 Leute vom heutigen Tag übrig geblieben waren. Insgesamt hatte ich auch den Eindruck, dass die Teilnahme gegenüber letztem Jahr diesmal sehr gering ausfiel.

Friday, September 21, 2007

Everybody DOS now....

Words fail me....
Just take a look at this vintage MS-DOS advertisement video clip...

'everybody DOS now...'

Now with 384K extra available RAM....Really I remember those days, but it seems to be ages away. Thank god I have my Mac....

[via MissM@rple's Blog]

DeLFI 2007 in Siegen - Nachlese

Wieder zu Hause stellt sich -- wie nach jedem Konferenzbesuch -- die Frage, was hat es gebracht? Zuallererst einmal eine 'fette' Erkältung, mit der ich heute morgen aufgewacht bin. Die eiszeitliche Klimatisierung der Hörsäle an der Uni Siegen, der man auch nicht durch einen beliebigen Sitzplatzwechsel aus dem Weg gehen konnte, fordert seinen Tribut. Vielleicht waren deswegen die Vorträge auch immer so spärlich besucht -- wobei wir ja schon beim Thema wären. Angeblich - so zumindest die Aussage des Co-Chairs bei der Ansprache während des Konferenz-Dinners - wären ja knapp 100 Teilnehmer alleine zur DeLFI angemeldet gewesen. Aber abgesehen von den Vortragenden (und das waren 23 Vortragende und eine Handvoll Posterpräsentationen) habe ich nicht wirklich viele Teilnehmer zählen können. Von den o.g. 23 Vorträgen waren für mich (einschließlich unseres eigenen) maximal 3.4 (wenn ich seeehr großzügig bin) interessant. Von den Themen (siehe hierzu auch die anderen Postings zur DeLFI 1 / 2 / 3) bot diese Veranstaltung wenig Neues. Ergebnisse von Evaluationen und Studien (jeweils wenig repräsentativ und manchmal mit recht fragwürdigen Schlüssen und Ergebnissen -- zumindest erschienen mir diese nicht ernsthaft hinterfragt worden zu sein), Detailverbesserungen in Nischen und ein oder zwei technisch ganz interessante Entwicklungen. Ich frage mich aber, wie sich ein nicht unwesentlicher Teil der Arbeiten für die Veranstaltung qualifizieren konnte. Liegt es vielleicht daran, dass aus dem e-Learning-Hype der vergangenen Jahre endlich 'die Luft heraus ist' ? Oder etwa daran, dass Didaktiker und Informatiker doch nicht die selbe Sprache sprechen...? Auch drängt sich manchmal der Gedanke auf, dass die 'Alteingesessenen' nur ihre Pfründe sichern wollen. Auch scheint es, dass der Blick über die Grenzen des Fachgebiets hinweg oftmals zu kurz kommt (-> zum Thema Logfileanalysen kann man ruhig mal den Leuten vom Information Retrieval und Data Mining über die Schulter blicken....bei der Navigationsanalyse sollte man sich mal die Methodik und die Ergebnisse der Usability-Gemeinde zu Gemüte führen...bei der Videosegmentierung sollte man sich nicht nur auf einen Spezialfall, der den Autor an der 'kurzen Leine hält' festlegen...Aber ich möchte dabei auf keinen Fall den Autoren zu nahe treten, sondern vielmehr an die Gutachter appellieren!).

Nunja, inhaltlich haben wir leider nichts dazugelernt. Zum Glück haben wir aber dennoch einige neue, interessante Kontakte knüpfen können und wir freuen uns schon auf eine mögliche Zusammenarbeit. Aber ob wir im kommenden Jahr wieder einreichen werden scheint mir eher unwahrscheinlich. Sollten wir uns neben unseren Kernthemen wieder dem Schwerpunkt e-Learning widmen, dann wahrscheinlich auf einer der internationalen Konferenzen mit eher technischer als didaktischer Ausrichtung.

Die Schönheit Siegens -- folgt man den Worten des Co-Chairs während des Konferenz-Dinners -- erschließt sich 'ja eher auf den zweiten Blick'. Dem kann ich spontan zustimmen. Zumindest gilt dies unwidersprochen für den Umkreis rund um den Bahnhof der Stadt. Erst am letzten Tag lernten wir die Oberstadt mit der historischen Altstadt, d.h. dem Schloss, Rathaus und Nikolaikirche kennen (-> hier gibts eine Menge Fotos von Siegen), so dass wir uns letztendlich etwas versöhnlicher auf die Heimreise begeben konnten.

Nächste Woche werde ich live von den XML-Tagen 2007 in Berlin und von der CSSW 2007 (Conference on Social Semantic Web) aus Leipzig berichten....