Technical and Semantic Analysis Meeting in Grenoble

Technical and Semantic Meeting
21-22 January 2010

Xerox Research Centre Europe,
6. chmin Maupertuis, 38240 Meylan, France

Technical Meeting

21 January 14h-18h

Crawling/indexing (1 1/2 hours)

  • architecture: interactions between crawler, repository, text extraction, morphological and semantic analysis, indexer
  • XML format for results of morphological and semantic analysis
  • increase size of collection
  • better detection of educational research documents (includes ISN classifier integration)
  • more metadata (what metadata is important, e.g. date published, title, author)

Break (30 min)

Querying (other than multilingual functionality) (1 hour)

  • use of key sentences for ranking and displaying results
  • interface usability and additional features
  • transfer of web interface development to Xerox?

Evaluation of semantic extraction, multilingual functionality (1 hour)

  • evaluation criteria, methodology
  • support for evaluation in query interface
  • 22. January 13h-15h
    two-hour technical session in parallel with semantic session

    WP9 multilingual functionality (1 1/2 hours, divided roughly as shown)

    • discuss resource types and how to use them in search engine (45 minutes)
      • IRDP & DIPF present their work on term networks
      • general-purpose resources
        • bilingual dictionaries
        • (monolingual thesauri and subject headings without concordances: are they useful?)
      • morphological analyzers
      • share info on available resources for 15 languages (15 min)
      • query translation user interface (1/2 hour)
        • look at CACAO interface and RRZN thesaurus demo together

    Wrap-up (1/2 hour)

    Semantic Analysis Meeting, 22 January 2010

    9h00 - 9h20 General introduction Ágnes Sándor
    9h20 - 9h50 Metadata, Thesauri, Multilingual concordances Angela Vorndran
    9h50 - 10h05 Dictionaries and morphological analysis Ágnes Sándor
    10h05 - 10h30 Break
    10h30 - 10h50 Testing Ágnes Sándor
    10h50 - 11h20 Sustainability Virginia Moukouli
    11h20 - 12h00 Automaitc semantic analysis Frederique,
    Agnes Sandor
    12h00 - 13h00 Break
    13h00 - 13h20 Bibliometrics and semantics Frederik
    13h20 - 13h40 Peer-reviewing Christian Rudelt,
    Ingrid Gogolin
    13h40 - 15h00 Common analysis of an article All participants of the meeting
    15h00 - 15h30 Break
    15h30 - 16h30 Discussion about an integrated view of the project All participants of the meeting
    16h30 - 17h00 Wrap-up All participants of the meeting