NewsExplorer
MedISys
NewsBrief
EMM-Labs


 

Ralf  Steinberger

European Commission
Joint Research Centre - Ispra site
Institute for the Protection and Security of the Citizen (IPSC)
Global Security and Crisis Management Unit (GlobeSec)

T.P. 267
21027 Ispra (VA), Italy

Ralf.Steinberger _@_ jrc.ec.europa.eu  (spam protection)

http://langtech.jrc.ec.europa.eu
http://emm.newsbrief.eu/overview.html
http://emm.newsexplorer.eu/
http://medusa.jrc.it/

Tel: +39 - 0332 78 6271 + 5648
Fax: +39 - 0332 78 5154

 

How NOT to spell my name:
Ralph Steinberger, Ralph Steinburger, Ralph Stienberger, Ralph Stienburger,

Ralf Steinburger, Ralf Stienburger, Ralf Steimberger, Ralf Stijnberger, Ralf Steimbergher,

Ralf Steinbergher, Ralf Stainbergher, Ralf Steinberg, Ralf Stienberg, Ralf Stienburg, Correct: Ralf Steinberger.

 

Professional Profile    Publications  
Professional Experience    Reports  
University Education   Hobbies and Interests
Languages Disclaimer

 

Professional Profile

I am a computational linguist with specialisation in multilingual and cross-lingual applications. My personal aim has always been to apply scientific knowledge to produce applications for real-life environments. Rather than aiming for monolingual performance optimisation, our focus always is on higher language coverage (typically 10 to 20 languages).


As a linguist and scientist, I have worked in the framework of different grammar theories, but I believe that systems using statistical and heuristic information in addition to linguistic knowledge can achieve better results. I started my LT career working with rule-based approaches (machine translation). The JRC requirement of covering many languages while working in a small team led me to statistical and Machine Learning approaches.


From an application point of view, I worked on machine translation, computer-assisted language learning, dictionary conversion, multilingual document generation, information extraction, keyword assignment and classification. Furthermore, I have initiated and supervised work on multilingual sentiment analysis (opinion mining), information extraction (including named entity recognition, event and relation extraction, geo-tagging, quotation extraction), document clustering and classification, document navigation, visualisation of extracted data, language recognition, summarisation, and various social networks, based on different types of input extracted from text.


Professional Experience


1998 - today

Language Technology Project Manager at the European Commission's Joint Research Centre (JRC) in Ispra (Italy).


Main focus: News analysis; multilingual document retrieval; information extraction and information visualisation, using mainly multilingual thesauri and nomenclatures, as well as statistical and Machine Learning techniques; multilingual linguistic resources (JRC-Acquis; JRC-Names; JRC Eurovoc Indexer JEX; DGT-Translation Memory, and more).


Our tool set includes multilingual tools for language recognition; automatic keyword identification; cross-lingual assignment of thesaurus indexing terms; the identification and disambiguation of geographical references in text (geo-tagging); creation of animated geographical maps on the basis of place names mentioned in text; document similarity calculation; clustering; classification; summarisation; terminology extraction; named entity recognition; event recognition, visualisation of the contents of large document collections and document navigation, sentiment analysis, software for document retrieval using a web crawling software agent. (See the JRC's Language Technology page)

1994 - 1997

Senior Research Scientist at Sharp Laboratories of Europe (SLE) in Oxford, UK: responsible for Language Technology products (multilingual document generator and multilingual phrase book); data mining from electronic dictionaries, machine translation, summarisation.

IV - VII 1994

Research Fellow at Kyushu Institute of Technology (KIT) in Iizuka, Japan.

1991 - 1994

Research Associate at the Language Technology Department of the University of Manchester - Institute of Science and Technology (UMIST) in the UK: machine translation and computer-assisted language learning (CALL).


Deputy head and technical manager of the UMIST teams of EC-funded MLAP machine translation projects TRADE and CAT2-EDS.

1991

Visiting Scientist at Institute for Applied Information Science (IAI) in Saarbrücken (D): machine translation.

1986 - 1990

Production management, marketing, public relations, sales promotion in the industrial rubber foam company PANA Schaumstoff GmbH in Geretsried (D) (part time / full time).

1984 - 1985

Teacher assistant at Lycée Louis-Le-Grand in Paris (F).

1981 - 1982

All-round executive training in the textiles firm PANA Werk KG in Wolfratshausen (D).


University Education


1992 - 1994

Ph.D. in Computational Linguistics at University of Manchester - Institute of Science and Technology (UMIST) in Manchester (UK): A Study of Word Order Variation in German with special Reference to Modifier Placement.

1983 - 1991

Magister Artium (M.A.) ‘with distinction’ in Theoretical Linguistics with French linguistics and Spanish linguistics at Ludwig Maximilians Universität (LMU) München (D), in parallel to working. Studies in Berlin and Munich.

1980

Abitur with specialisation in French and Mathematics at Gymnasium Pullach (D).


Language Skills


German

Native language

English

7 years at high-school level, lived in the UK for 6 years, major working language since 1991.

French

7 years at high-school level, lived in France for ~ 24 months; university studies of French linguistics, language, literature and mediaeval studies.

Italian

Courses at university and at the JRC; living in Italy since 1998.

Spanish

4 months of intensive, full-time language courses in Spain. University studies of Spanish linguistics, language, literature and mediaeval studies.

Japanese

Basic notions of the grammar and of the writing systems.


Hobbies and Interests


I am interested in projects to support developing countries. I am an active member of the charitable EC – JRC organisation Association Europe – Third World (Europa - Terzo Mondo, ETM).

 


My favourite sports are table tennis, volleyball and tennis. I also like to play softball and Frisbee.

 

I love to travel, especially long-distance, or to experience a country through longer work-related stays.

 

I am interested in photography, and especially in capturing people. A few photos from trips to Senegal, Cameroon, India, Mali, Ethiopia, Zambia, Qatar, Ukraine, Armenia, Japan, Istanbul, Saint Petersburg and more are online. I am a member of the Foto-Cine-Club at the Joint Research Centre in Ispra.

 

I like playing chess. I am interested in cinema, going to the theatre and visiting art galleries, although our children recently did not leave a lot of time for these activities.

 

I actively enjoy meeting new people at social gatherings.

Publications      (Please contact the author for papers not available here) (Look on Google Scholar)

  • Steinberger Ralf, Guillaume Jacquet & Leonida Della Rocca (forthcoming). Creation and Use of Multilingual Named Entity Variant Dictionaries. In: Marc Van Campenhoudt & Rita Temmerman (eds): Translation at the Frontiers of the Lexicon: The New Fields of Terminology. Éditions Modulaires Européennes, Fernelmont, Belgium.
  • Steinberger Ralf, Mohamed Ebrahim, Alexandros Poulis, Manuel Carrasco-Benitez, Patrick Schlüter, Marek Przybyszewski & Signe Gilbro (2014). An overview of the European Union's highly multilingual parallel corpora. Language Resources and Evaluation Journal (LRE). DOI: 10.1007/s10579-014-9277-0. (Read Manuscript)
  • Steinberger Josef, Ralf Steinberger, Hristo Tanev, Vanni Zavarella & Marco Turchi (2014). Aspects of Multilingual News Summarisation. In: Alessandro Fiori (ed): Innovative Document Summarization Techniques: Revolutionizing Knowledge Understanding, pp. 277-294. IGI Global Information Science Reference Series, Hershey, PA, USA.
  • Balahur Alexandra, Eric van der Goot, Ralf Steinberger & Andrés Montoyo (eds.) (2014). Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA'2014), held at ACL'2014. Baltimore (MD), USA, 27 June. ISBN: 978-1-941643-11-2.
  • Pajzs Júlia, Ralf Steinberger, Maud Ehrmann, Mohamed Ebrahim, Leonida Della Rocca, Stefano Bucci, Eszter Simon, Tamás Váradi (2014). Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian. Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC), Reykjavik, Iceland, 26-31 May 2014, pp. 2049-2056.
  • Hajlaoui Najeh, David Kolovratnik, Jaakko Väyrynen, Daniel Várga, Ralf Steinberger (2014). DCEP - Digital Corpus of the European Parliament. Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC), Reykjavik, Iceland, 26-31 May 2014, pp. 3164-3171.
  • Küçük Dilek, Guillaume Jacquet, Ralf Steinberger (2014). Named entity recognition on Turkish Tweets. Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC), Reykjavik, Iceland, 26-31 May 2014, pp. 450-454.
  • Jacquet Guillaume, Maud Ehrmann, Ralf Steinberger (2014). Clustering of multi-word named entity variants: Multilingual evaluation. Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC), Reykjavik, Iceland, 26-31 May 2014, pp. 2548-2553. (Read online)
  • Balahur Alexandra, Marco Turchi, Ralf Steinberger, José M. Perea-Ortega, Guillaume Jacquet, Dilek Küçük, Vanni Zavarella, Adil El Ghali (2014). Resource creation and evaluation for multilingual sentiment analysis in social media texts. Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC), Reykjavik, Iceland, 26-31 May 2014, pp. 4265-4269.
  • Küçük Dilek & Ralf Steinberger (2014). Experiments to Improve Named Entity Recognition on Turkish Tweets. Proceedings of the EACL'2014 workshop Language Analysis in Social Media (LASM), pp. 71-78. Gothenburg, Sweden, 27 April 2014.
  • Zavarella Vanni, Hristo Tanev, Ralf Steinberger & Eric van der Goot (2014). An ontology-based approach to social media mining for crisis management. Proceedings of the Workshop on Social Media and Linked Data for Emergency Response (SMILE'2014). Anissaras, Crete, Greece, 25-29 May 2014.
  • Steinberger Ralf, Maud Ehrmann, Júlia Pajzs, Mohamed Ebrahim, Josef Steinberger & Marco Turchi (2013). Multilingual Media Monitoring and Text Analysis – Challenges for highly inflected languages. In: Ivan Habernal & Václav Matoušek (eds). Text, Speech and Dialogue. 16th International Conference, TSD 2013, Pilsen, Czech Republic, September 2013, Proceedings. Springer Lecture Notes in Artificial Intelligence LNAI 8082, pp. 22-33. (Manuscript of 'Multilingual Media Monitoring and Text Analysis – Challenges for highly inflected languages')
  • Steinberger Ralf (2013). Multilingual and cross-lingual news analysis in the Europe Media Monitor (EMM). In: Mihai Lupu, Evangelos Kanoulas & Fernando Loizides (eds.): Multidisciplinary Information Retrieval. 6th Information Retrieval Facility Conference (IRFC'2013), Limassol, Cyprus. Springer Lecture Notes in Computer Science, Vol. 8201, pp. 1-4.
  • Kabadjov Mijail, Josef Steinberger & Ralf Steinberger (2013). Multilingual Statistical News Summarization. In: Thierry Poibeau, Horacio Saggion, Jakub Piskorski & Roman Yangarber (eds), Multi-source, Multilingual Information Extraction and Summarization, pp. 229-252. Isbn: 978-3-642-28569-1, Doi: 10.1007/978-3-642-28569-1_11, Springer Series: Theory and Applications of Natural Language Processing; Berlin & Heidelberg, Germany.
  • Ehrmann Maud, Leonida della Rocca, Steinberger Ralf & Hristo Tanev (2013). Acronym recognition and processing in 22 languages. Proceedings of the 9th Conference Recent Advances in Natural Language Processing (RANLP), pp. 237-244. Hissar, Bulgaria, 7-13 September 2013.
  • Steinberger Ralf (2012). A survey of methods to ease the development of highly multilingual Text Mining applications. Language Resources and Evaluation Journal, Springer, Volume 46, Issue 2, pp. 155-176 (DOI 10.1007/s10579-011-9165-9). (pre-final version of 'A survey of methods to ease the development of highly multilingual text mining applications')
  • Steinberger Ralf (2012). Het nut van de enorme meertalige tekstcollecties van de EU om taaltechnologische oplossingen voor alle EU-talen te bouwen (English: Exploiting the EU’s enormous multilingual text collections to build Language Technology solutions for all EU languages). In: HLT Magazine DIXIT, Tijdschrift over Taal- en Spraaktechnologie. Issue: Big Data en TST.
  • Steinberger Ralf, Mohamed Ebrahim & Marco Turchi (2012). JRC EuroVoc Indexer JEX - A freely available multi-label categorisation tool. Proceedings of the 8th international conference on Language Resources and Evaluation (LREC'2012), pp. 798-805, Istanbul, 21-27 May 2012.
  • Steinberger Ralf, Andreas Eisele, Szymon Klocek, Spyridon Pilos & Patrick Schlüter (2012). DGT-TM: A freely Available Translation Memory in 22 Languages. Proceedings of the 8th international conference on Language Resources and Evaluation (LREC'2012), pp. 454-459, Istanbul, 21-27 May 2012.
  • Ebrahim Mohamed, Maud Ehrmann, Marco Turchi & Ralf Steinberger (2012). Multi-label EuroVoc classification for Eastern and Southern EU Languages. In: Cristina Vertan & Walther v. Hahn: Multilingual processing in Eastern and Southern EU languages - Low-resourced technologies and translation, pp. 370-394. ISBN: 978-1-4438-3878-8, Cambridge Scholars Publishing, Cambridge, UK.
  • Steinberger Ralf (2012). Cross-lingual similarity calculation for plagiarism detection and more - Tools and Resources. In: Pamela Forner, Jussi Karlgren & Christa Womser-Hacker (eds): CLEF 2012 Evaluation Labs and Workshop. Abstracts - Working Notes Papers, p.81. FBK Press, Trento, Italy.
  • Turchi Marco, Martin Atkinson, Alastair Wilcox, Brett Crawley, Stefano Bucci, Ralf Steinberger & Erik van der Goot (2012). ONTS: "OPTIMA" News Translation System. Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pp. 25–30, Avignon, France, April 23 - 27 2012.
  • Steinberger Josef, Polina Lenkova, Mohamed Ebrahim, Maud Ehrmann, Silvia Vázquez, Ali Hürriyetoğlu, Mijail Kabadjov, Ralf Steinberger, Hristo Tanev & Vanni Zavarella (2012). Creating Sentiment Dictionaries via Triangulation. Journal Decision Support Systems 53 (4); 2012, pp. 689-694, DOI: http://dx.doi.org/10.1016/j.dss.2012.05.029, ISSN: 0167-9236, Elsevier, Amsterdam.
  • Balahur Alexandra, Mijail Kabadjov, Josef Steinberger, Ralf Steinberger & Andrés Montoyo (2012). Challenges and solutions in the opinion summarization of user-generated content. Journal of Intelligent Information Systems (JIIS) 39 (2); 2012, pp. 375-398, Springer. DOI: 10.1007/s10844-011-0194-z.
  • Steinberger Ralf, Sylvia Ombuya, Mijail Kabadjov, Bruno Pouliquen, Leonida Della Rocca, Jenya Belyaeva, Monica De Paola & Erik van der Goot (2011). Expanding a multilingual media monitoring and information extraction tool to a new language: Swahili. Language Resources and Evaluation Journal, Volume 45, Issue 3, pp. 311-330 (DOI 10.1007/s10579-011-9165-9). (pre-final version)
  • Steinberger Ralf, Bruno Pouliquen, Mijail Kabadjov & Erik van der Goot (2011). JRC-Names: A freely available, highly multilingual named entity resource. Proceedings of the 8th International Conference Recent Advances in Natural Language Processing (RANLP'2011), pp. 104-110. Hissar, Bulgaria, 12-14 September 2011.
  • Steinberger Ralf (2011). Combining various text analysis tools for multilingual media monitoring. In: Hamburg Working Paper in Multilingualism 96-2011. In: Hanna Hedeland, Thomas Schmidt, Kai Wörner (eds.). Multilinguali Resources and Multilingual Applications. Proceedings of the Conference of the German Society for Computational Linguistics and Language Technology (GSCL'2011), pp. 25-30. Hamburg, Germany, 28-30 September 2011
  • Steinberger Josef, Polina Lenkova, Mijail Kabadjov, Ralf Steinberger & Erik van der Goot (2011). Multilingual Entity-Centered Sentiment Analysis Evaluated by Parallel Corpora. Proceedings of the 8th International Conference Recent Advances in Natural Language Processing (RANLP'2011), pp. 770-775. Hissar, Bulgaria, 12-14 September 2011.
  • Steinberger Josef, Jenya Belyaeva, Jonathan Crawley, Leonida Della Rocca, Mohamed Ebrahim, Maud Ehrmann, Mijail Kabadjov, Ralf Steinberger & Erik van der Goot (2011). Highly Multilingual Coreference Resolution Exploiting a Mature Entity Repository. Proceedings of the 8th International Conference Recent Advances in Natural Language Processing (RANLP'2011), pp. 254-260. Hissar, Bulgaria, 12-14 September 2011.
  • Steinberger Josef, Hristo Tanev, Mijail Kabadjov & Ralf Steinberger (2011). Aspect-Driven News Summarization. In: International Journal of Computational Linguistics and Applications 2 (1-2), pp 301-317, Bahri Publications, ISSN: 0976-0962. (Pre-final version)
  • Ehrmann Maud, Marco Turchi & Ralf Steinberger (2011). Building a Multilingual Named Entity-Annotated Corpus. Proceedings of the 8th International Conference Recent Advances in Natural Language Processing (RANLP'2011), pp. 118-124. Hissar, Bulgaria, 12-14 September 2011.
  • Steinberger Josef, Polina Lenkova, Mohamed Ebrahim, Maud Ehrmann, Silvia Vázquez, Ali Hürriyetoğlu, Mijail Kabadjov, Ralf Steinberger, Hristo Tanev & Vanni Zavarella (2011). Creating Sentiment Dictionaries via Triangulation. Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, WASSA, held at the ACL-HLT Conference, pp. 28-36. Portland, Oregon, USA, 24 June 2011.
  • Steinberger Josef, Mijail Kabadjov, Ralf Steinberger, Hristo Tanev, Marco Turchi & Vanni Zavarella (2011). Towards Language-Independent News Summarization. Proceedings of the Text Analysis Conference (TAC'2011). National Institute of Standards and Technology (NIST), Gaithersburg, Maryland, USA, 14-15 November 2011.
  • Steinberger Ralf (2010). Challenges and Methods for Multilingual Text Mining. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC'2010). Valletta, Malta, 19-21 May 2010.
  • Tanev Hristo, Bruno Pouliquen, Vanni Zavarella & Ralf Steinberger (2010). Automatic Expansion of a Social Network Using Sentiment Analysis. In: Nasrullah Memon, Jennifer Jie Xu, David Hicks & Hsinchun Chen (eds). Annals of Information Systems, Volume 12. Special Issue on Data Mining for Social Network Data, pp. 9-29. Springer Science and Business Media (DOI 10.1007/978-1-4419-6287-4_2).
  • Linge Jens, Ralf Steinberger, Flavio Fuart, Stefano Bucci, Jenya Belyaeva, Monica Gemo, Delilah Al-Khudhairy, Roman Yangarber & Erik van der Goot (2010). MedISys: Medical Information System. In: Eleana Asimakopoulou & Nik Bessis (eds). Advanced ICTs for Disaster Management and Threat Detection: Collaborative and Distributed Frameworks, pp. 131-142. IGI Global. (Purchase online)
  • Balahur Alexandra, Ralf Steinberger, Mijail Kabadjov, Vanni Zavarella, Erik van der Goot, Matina Halkia, Bruno Pouliquen & Jenya Belyaeva (2010). Sentiment Analysis in the News. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC'2010), pp. 2216-2220. Valletta, Malta, 19-21 May 2010.
  • Turchi Marco, Josef Steinberger, Mijail Kabadjov & Ralf Steinberger (2010). Using parallel corpora for multilingual (multi-document) summarisation evaluation. In: Maristella Agosti, Nicola Ferro, Carol Peters, Maarten de Rijke & Alan Smeaton. Multilingual and Multimodal Information Access Evaluation. Springer Lecture Notes for Computer Science, LNCS 6360/2010, pp. 52-63 (Presented at CLEF'2010).
  • Steinberger Josef, Marco Turchi, Mijail Kabadjov, Nello Cristianini & Ralf Steinberger (2010). Wrapping up a Summary: from Representation to Generation. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'2010), pp. 382-386. Uppsala, Sweden, 11-16 July.
  • Kabadjov Mijail, Martin Atkinson, Josef Steinberger, Ralf Steinberger & Erik van der Goot (2010). NewsGist: A Multilingual Statistical News Summarizer. In: Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD'2010). Barcelona, Spain, 20-24 September 2010. In: José Luis Balcázar, Francesco Bonchi, Aristides Gionis and Michèle Sebag (eds): Lecture Notes in Computer Science, Vol. 6323, pp. 591-594. Springer.
  • Zaghouani Wajdi, Bruno Pouliquen, Mohamed Ebrahim & Ralf Steinberger (2010). Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC'2010), pp. 563-567. Valletta, Malta, 19-21 May 2010.
  • Kabadjov Mijail, Josef Steinberger, Ralf Steinberger, Massimo Poesio & Bruno Pouliquen (2010). Enhancing N-Gram-based Summary Evaluation Using Information Content and a Taxonomy. In: Proceedings of the 32nd European Conference on Information Retrieval Research (ECIR'2010). Milton Keynes, UK, 28-31 March 2010. In: C. Gurrin et al. (eds): Lecture Notes in Computer Science, Vol. 5993, pp. 662-666. Springer. (Purchase Online)
  • Zavarella Vanni, Hristo Tanev, Jens Linge, Jakub Piskorski, Martin Atkinson & Ralf Steinberger (2010). Exploiting Multilingual Grammars and Machine Learning Techniques to Build an Event Extraction System for Portuguese. In: Proceedings of the International Conference on Computational Processing of Portuguese Language (PROPOR'2010), Porto Alegre, Brazil, 27-30 April 2010. Springer Lecture Notes for Artificial Intelligence, Vol. 6001, pp. 21-24. Springer.
  • Steinberger Ralf & Bruno Pouliquen (2009). Cross-lingual Named Entity Recognition. In: Satoshi Sekine & Elisabete Ranchhod (eds.): Named Entities - Recognition, Classification and Use, Benjamins Current Topics, Volume 19, pp. 137-164. John Benjamins Publishing Company. ISBN 978-90-272-8922 3.
  • Steinberger Ralf, Bruno Pouliquen & Erik van der Goot (2009). An Introduction to the Europe Media Monitor Family of Applications. In: Fredric Gey, Noriko Kando & Jussi Karlgren (eds.): Information Access in a Multilingual World - Proceedings of the SIGIR 2009 Workshop (SIGIR-CLIR'2009), pp. 1-8. Boston, USA. 23 July 2009.
  • Pouliquen Bruno & Ralf Steinberger (2009). Automatic Construction of Multilingual Name Dictionaries. In: Cyril Goutte, Nicola Cancedda, Marc Dymetman & George Foster (eds.): Learning Machine Translation. MIT Press - Advances in Neural Information Processing Systems Series (NIPS).
  • Koehn Philipp, Alexandra Birch & Ralf Steinberger (2009). 462 Machine Translation Systems for Europe. In: Laurie Gerber, Pierre Isabelle, Roland Kuhn, Nick Bemish, Mike Dillinger & Marie-Josée Goulet (eds.): Proceedings of the Twelfth Machine Translation Summit (MT-Summit XII), pages 65-72. Ottawa, Canada, 26-30 August 2009.
  • Tanev Hristo, Vanni Zavarella, Jens Linge, Mijail Kabadjov, Jakub Piskorski, Martin Atkinson & Ralf Steinberger (2009). Exploiting Machine Learning Techniques to Build an Event Extraction System for Portuguese and Spanish. In: linguaMÁTICA Journal:2, pp. 55-66. Available at: http://linguamatica.com/index.php/linguamatica/article/view/37.
  • Balahur-Dobrescu Alexandra & Ralf Steinberger (2009). Rethinking sentiment analysis in the news: from theory to practice and back. 'Workshop on Opinion Mining and Sentiment Analysis' (WOMSA), held at the 2009 CAEPIA-TTIA 13th Conference of the Spanish Association for Artificial Intelligence, pp. 1-12. Sevilla, Spain, 13.11.2009.
  • Steinberger Ralf (2009). Preface. In: Tadić Marco, Bojana Dalbelo Bašić, Marie-Francine Moens (eds.): Technologies for the Processing and Retrieval of Semi-Structured Documents - Experience from the CADIAL Project, pp. vii-ix. Croatian Language Technologies Society, Zagreb, Croatia. (Table-of-Contents; Cover)
  • Balahur-Dobrescu Alexandra, Mijail Kabadjov, Josef Steinberger, Ralf Steinberger & Andrés Montoyo (2009). Summarizing Opinions in Blog Threads. Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation (PACLIC), pp. 606-613, Hong Kong, 3-5 December 2009.
  • Steinberger Josef, Mijail Kabadjov, Bruno Pouliquen, Ralf Steinberger & Massimo Poesio (2009). WB-JRC-UT's Participation in TAC 2009: Update Summarization and AESOP Tasks. In: Proceedings of the Text Analysis Conference 2009 (TAC'2009). National Institute of Standards and Technology, Gaithersburg, Maryland USA, 16-17 November 2009.
  • Kabadjov Mijail, Josef Steinberger, Bruno Pouliquen, Ralf Steinberger & Massimo Poesio (2009). Multilingual Statistical News Summarisation: Preliminary Experiments with English. Proceedings of 'IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology', pp. 519-522; Workshop 'Intelligent Analysis and Processing of Web News Content' (IAPWNC). Milano, Italy, 15.09.2009. (PDF)
  • Balahur Alexandra, Ralf Steinberger, Erik van der Goot, Bruno Pouliquen & Mijail Kabadjov (2009). Opinion Mining on Newspaper Quotations. Proceedings of 'IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology', pp. 523-526; Workshop 'Intelligent Analysis and Processing of Web News Content' (IAPWNC). Milano, Italy, 15.09.2009.
  • Steinberger Ralf (2009). Linking News Content Across Languages. In: Kristiina Jokinen & Eckhard Bick (eds.) NEALT Proceedings Series Vol.4 - Proceedings of the 17th Nordic Conference of Computational Linguistics (NODALIDA'2009), p. 4-5, Odense, Denmark, 14-16 May 2009.
  • Linge Jens, Ralf Steinberger, Thomas Weber, Roman Yangarber, Erik van der Goot, Delilah Al Khudhairy & Nikolaos Stilianakis (2009). Internet Surveillance Systems for Early Alerting of Health Threats. EuroSurveillance Vol. 14, Issue 13. Stockholm, Sweden, 2 April 2009. (PDF)
  • Yangarber Roman, Peter von Etter & Ralf Steinberger (2009). Automatic Epidemiological Surveillance from On-line News in MedISys and PULS. Proceedings of the International Meeting on Emerging Diseases and Surveillance (IMED'2009), Vienna, 13-16 February 2009.
  • Norguet Jean-Pierre, Esteban Zimányi & Ralf Steinberger (2009). Semantic analysis of web site audience by integrating web usage mining and web content mining. In I-Hsien Ting (editor): Web Mining Applications in E-commerce and E-services, Springer Verlag book series Studies in Computational Intelligence, October 2008. (Purchase online)
  • Steinberger Ralf, Pouliquen Bruno & Camelia Ignat (2008). Using language-independent rules to achieve high multilinguality in Text Mining. In: Fogelman-Soulié Françoise, Domenico Perrotta, Jakub Piskorski & Ralf Steinberger (eds.): Mining Massive Data Sets for Security. pp. 217-240. IOS Press, Amsterdam, The Netherlands.
  • Steinberger Ralf, Flavio Fuart, Erik van der Goot, Clive Best, Peter von Etter & Roman Yangarber (2008). Text Mining from the Web for Medical Intelligence. In: Fogelman-Soulié Françoise, Domenico Perrotta, Jakub Piskorski & Ralf Steinberger (eds.): Mining Massive Data Sets for Security. pp. 295-310. IOS Press, Amsterdam, The Netherlands.
  • Fogelman-Soulié Françoise , Perrotta Domenico, Jakub Piskorski & Ralf Steinberger (eds.) (2008): Mining Massive Data Sets for Security. IOS Press, Amsterdam, The Netherlands.
  • Best Clive, Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger & Hristo Tanev (2008). Automatic Event Extraction for the Security Domain. In: Intelligence and Security Informatics - Techniques and Applications, Volume 135/2008, pp. 17-43, Studies in Computational Intelligence Series, Springer, Heidelberg/New York. (Purchase online)
  • Pouliquen Bruno & Ralf Steinberger (2008). Story tracking: linking similar news over time and across languages. In Proceedings of the 2nd workshop Multi-source Multilingual Information Extraction and Summarization (MMIES'2008) held at CoLing'2008. Manchester, UK, 23 August 2008.
  • Atkinson Martin, Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger, Hristo Tanev & Vani Zavarella (2008). Online-monitoring of security-related events. In Proceedings of the 22nd International Conference on Computational Linguistics (CoLing'2008). Manchester, UK, 18-22 August 2008. (PDF)
  • Steinberger Ralf, Flavio Fuart, Bruno Pouliquen & Erik van der Goot (2008). MedISys: A Multilingual Media Monitoring Tool for Medical Intelligence and Early Warning. In: Proceedings of the International Disaster and Risk Conference (IDRC'2008), pp. 612-614, Davos, Switzerland.
  • Yangarber Roman, Peter von Etter & Ralf Steinberger (2008). Content Collection and Analysis in the Domain of Epidemiology. In Proceedings of the 1st international MIE'2008 workshop on describing medical web resources (DRMed), held at the 21st International Congress of the European Federation for Medical Informatics. Göteborg, Sweden, 27 May 2008. (PDF)
  • Steinberger Ralf & Bruno Pouliquen (2007). Cross-lingual Named Entity Recognition. In: Satoshi Sekine & Elisabete Ranchhod (eds.) Journal Linguisticae Investigationes, Special Issue on Named Entity Recognition and Categorisation, LI 30:1, pp. 135-162. John Benjamins Publishing Company. ISSN 0378-4169.
  • Pouliquen Bruno, Ralf Steinberger, Clive Best (2007). Automatic detection of quotations in multilingual news. Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP'2007), pp. 487-492. Borovets, Bulgaria, 27-29 September 2007.
  • Pouliquen Bruno, Ralf Steinberger, Jenya Belyaeva (2007). Multilingual multi-document continuously updated social networks. Proceedings of the Workshop Multi-source Multilingual Information Extraction and Summarization (MMIES'2007) held at RANLP'2007, pp. 25-32. Borovets, Bulgaria, 26 September 2007. (PDF)
  • Yangarber Roman, Clive Best, Peter von Etter, Flavio Fuart, David Horby & Ralf Steinberger (2007). Combining Information about Epidemic Threats from Multiple Sources. Proceedings of the Workshop Multi-source Multilingual Information Extraction and Summarization (MMIES'2007) held at RANLP'2007, pp. 41-48. Borovets, Bulgaria, 26 September 2007.
  • Pouliquen Bruno & Ralf Steinberger (2007). Acquisition and Use of Multilingual Name Dictionaries. Proceedings of the Workshop Acquisition and Management of Multilingual Lexicons (AMML'2007) held at RANLP'2007. Borovets, Bulgaria, 26 September 2007.
  • Piskorski Jakub, Hristo Tanev, Bruno Pouliquen & Ralf Steinberger (eds.) (2007). Proceedings of the Workshop on Balto-Slavonic Natural Language Processing 2007 (BSNLP'2007) - Special Theme: Information Extraction and Enabling Technologies. Held at the 45th Annual Meeting of the Association for Computational Linguistics (ACL'2007). Prague, Czech Republic, 29 June 2007. (PDF of the Preface) (Full BSNLP Proceedings)
  • Steinberger Ralf,  Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaž Erjavec, Dan Tufiş, Dániel Varga (2006). The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages. Proceedings of the 5thInternational Conference on Language Resources and Evaluation (LREC'2006), pp. 2142-2147. Genoa, Italy, 24-26 May 2006.
  • Pouliquen Bruno, Marco Kimler, Ralf Steinberger,  Camelia Ignat, Tamara Oellinger, Ken Blackler, Flavio Fuart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best (2006). Geocoding multilingual texts: Recognition, Disambiguation and Visualisation. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'2006), pp. 53-58. Genoa, Italy, 24-26 May 2006.
  • Norguet Jean-Pierre, Esteban Zimányi & Ralf Steinberger (2006). Semantic analysis of web site audience. 21st Annual ACM Symposium on Applied Computing (ACM SAC'2006), Dijon, France, 23-27.04.2006. Pages 525-529.
  • Žižka Jan, Jiří Hroza, Bruno Pouliquen, Camelia Ignat & Ralf Steinberger (2006). The selection of electronic text documents supported by only positive examples. Proceedings of the 8th International Conference on the Statistical Analysis of Textual Data (JADT'2006). Besançon, 19-21 April 2006.
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat & Tamara Oellinger (2006). Building and displaying name relations using automatic unsupervised analysis of newspaper articles. Proceedings of the 8th International Conference on the Statistical Analysis of Textual Data (JADT'2006). Besançon, 19-21 April 2006.
  • Best Clive, Bruno Pouliquen, Ralf Steinberger, Eric van der Goot, Ken Blackler, Flavio Fuart, Tamara Oellinger & Camelia Ignat (2006). Towards automatic event tracking. In: Sharad Mehrota, Daniel Zeng, Hsinchun Chen, Bhavani Thuraisingham & Fei-Yue Wang (Eds.): Intelligence and Security Informatics - Proceedings of IEEE International Conference on Intelligence and Security Informatics (ISI'2006), San Diego, California, USA, 23-24.05.2006. Lecture Notes in Computer Science, LNCS 3975, pp. 26-34. Springer-Verlag, Berlin Heidelberg, New York. ISBN: 978-3-540-34478-0.
  • Norguet Jean-Pierre, Esteban Zimányi & Ralf Steinberger (2006). Improving web sites with web usage mining, web content mining, and semantic analysis. In: Jirí Wiedermann, Gerard Tel, Jaroslav Pokorný, Mária Bieliková, Július Štuller (Eds.): SOFSEM 2006: Theory and Practice of Computer Science. 32nd Conference on Current Trends in Theory and Practice of Computer Science, Merin, Czech Republic, 21.-27.01.2006. Lecture Notes in Computer Science, LNCS 3831, pages 430-439. ISBN: 978-3-540-31198-0. Springer-Verlag, Berlin, Heidelberg, New York.
  • Steinberger Ralf, Bruno Pouliquen, Camelia Ignat (2005). Navigating multilingual news collections using automatically extracted information. Journal of Computing and Information Technology - CIT 13, 2005, 4, 257-264. Available online at: http://cit.zesoi.fer.hr/downloadPaper.php?paper=767. ISSN: 1330-1136.
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Irina Temnikova, Anna Widiger, Wajdi Zaghouani & Jan Žižka (2005). Multilingual person name recognition and transliteration. Journal CORELA - Cognition, Représentation, Langage. Numéros spéciaux, Le traitement lexicographique des noms propres. Available online at: http://edel.univ-poitiers.fr/corela/document.php?id=490. ISSN 1638-5748.
  • Erjavec Tomaž, Camelia Ignat, Bruno Pouliquen & Ralf Steinberger (2005). Massive multilingual corpus compilation: Acquis Communautaire and totale. Journal Archives of Control Sciences, Volume 15(LI), 2005, No. 4, pages 529-540.
  • Steinberger Ralf, Bruno Pouliquen, Camelia Ignat (2005). Navigating multilingual news collections using automatically extracted information. In: Vesna Lužar-Stiffler & Vesna Hljuz Dobrić (Eds.): Proceedings of the 27th International Conference 'Information Technology Interfaces' (ITI'2005), pp. 27-34. Cavtat / Dubrovnik, Croatia, June 20-23, 2005.
  • Montejo-Ráez Arturo, L. Alfonso Ureña-López & Ralf Steinberger (2005). Text categorisation using bibliographic records: beyond document content. Procesamiento del Lenguaje Natural, núm. 35 (2005), pp. 119-126. Proceedings of the 21st Conference of the Spanish Society for Natural Language Processing (SEPLN'2005). Granada, Spain, 14-16 September 2005.
  • Ignat Camelia, Bruno Pouliquen, Ralf Steinberger & Tomaž Erjavec (2005). A tool set for the quick and efficient exploration of large document collections. Proceedings of the Symposium on Safeguards and Nuclear Material Management. 27th Annual Meeting of the European SAfeguards Research and Development Association (ESARDA-2005). London, UK, 10-12 June 2005.
  • Tomaž Erjavec, Camelia Ignat, Bruno Pouliquen & Ralf Steinberger (2005). Massive multilingual corpus compilation; Acquis Communautaire and totale. In: 2nd Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics (L&T'05). Poznań, Poland, 21-23 April 2005.
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Irina Temnikova, Wajdi Zaghouani & Jan Žižka (2005). Detection of person names and their translations in multilingual news. Colloque Traitement lexicographique des noms propres, Tours, 24 March 2005.
  • Best Clive, Erik van der Goot, Ken Blackler, Teófilo Garcia, David Horby, Ralf Steinberger and Bruno Pouliquen (2005). Mapping World Events. In: Peter van Oosterom, Siyka Zlatanova & Elfriede M. Fendel (eds.) Geo-information for Disaster Management. pp. 683-696. Springer. ISBN: 3-540-24988-5.
  • Pouliquen Bruno, Ralf Steinberger & Camelia Ignat (2004). Automatic Linking of Similar Texts Across Languages. In:  N. Nicolov, K. Bontcheva, G. Angelova & R. Mitkov (eds.): Current Issues in Linguistic Theory 260 - Recent Advances in Natural Language Processing III. Selected Papers from RANLP'2003. John Benjamins Publishers, Amsterdam.
  • Steinberger Ralf, Pouliquen Bruno & Camelia Ignat (2004). Providing cross-lingual information access with knowledge-poor methods. In: Informatica. An international Journal of Computing and Informatics. Volume 28. Special Issue.
  • Montejo-Ráez Arturo & Ralf Steinberger (2004). Why keywording matters. In. High Energy Physics Libraries Webzine, Issue 10, December 2004. Available at http://library.cern.ch/HEPLW/10/papers/2/. (PDF)
  • Ralf Steinberger, Pouliquen Bruno & Camelia Ignat (2004). Exploiting Multilingual Nomenclatures and Language-Independent Text Features as an Interlingua for Cross-lingual Text Analysis Applications. In: Proceedings of the 4th Slovenian Language Technology Conference. Information Society 2004 (IS'2004). Ljubljana, Slovenia, 13-14 October 2004. (PDF)
  • Montejo-Ráez Arturo, Luís Alfonso Ureña-López, Ralf Steinberger (2004). Adaptive selection of base classifiers in one-against-all learning for large multi-labeled collections. In: J.L. Vicedo, P. Martínez-Barco, R. Muñoz et al. (eds). Advances in Natural Language Processing: 4th International Conference, España for Natural Language Processing (EsTAL'2004), Proceedings, Alicante, Spain, 20-22 October 2004. Springer Lecture Notes in Computer Science, LNCS 3230, pages 1-12. Springer-Verlag, Berlin Heidelberg. ISBN: 3-540-23498-5. (PDF)
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Emilia Käsper & Irina Temnikova (2004). Multilingual and Cross-lingual News Topic Tracking. In: Proceedings of the 20th International Conference on Computational Linguistics (CoLing'2004). Geneva, Switzerland, 23-27 August 2004. (PDF)
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat & Tom de Groeve (2004). Geographical Information Recognition and Visualisation in Texts Written in Various Languages. In: Proceedings of the 19th Annual ACM Symposium on Applied Computing (SAC'2004), Special Track on Information Access and Retrieval (SAC-IAR), vol. 2, pp. 1051-1058. Nicosia, Cyprus, 14 - 17 March 2004.
  • Pouliquen Bruno, Ralf Steinberger & Camelia Ignat (2003). Automatic Identification of Document Translations in Large Multilingual Document Collections. In: Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP'2003), pp. 401-408. Borovets, Bulgaria, 10 - 12 September 2003. (PDF)
  • Ignat Camelia, Bruno Pouliquen, António Ribeiro & Ralf Steinberger (2003). Extending an Information Extraction Tool Set to Central and Eastern European Languages. In: Proceedings of the International Workshop Information Extraction for Slavonic and other Central and Eastern European Languages (IESL'2003), held at RANLP'2003, pp. 33-39. Borovets, Bulgaria, 8 - 9 September 2003. (PDF)
  • Pouliquen Bruno, Steinberger Ralf, Camelia Ignat (2003). Automatic Annotation of Multilingual Text Collections with a Conceptual Thesaurus. In: Proceedings of the Workshop Ontologies and Information Extraction at the Summer School The Semantic Web and Language Technology - Its Potential and Practicalities (EUROLAN'2003). Bucharest, Romania, 28 July - 8 August 2003 (PDF).
  • Steinberger Ralf, Bruno Pouliquen, Stefan Scheer & António Ribeiro (2003). Continuous Multi-Source Information Gathering and Classification. In: Proceedings of the International Conference on Computational Intelligence for Modelling, Control and Automation (CIMCA'2003). Vienna (A), 12-14 February 2003 (PDF).
  • Steinberger Ralf, Bruno Pouliquen & Johan Hagman (2002). Cross-lingual Document Similarity Calculation Using the Multilingual Thesaurus Eurovoc. In: A. Gelbukh (ed.) Computational Linguistics and Intelligent Text Processing, Third International Conference, CICLing'2002. Springer Lecture Notes in Computer Science, LNCS 2276, pp. 415-424. Mexico-City, Mexico, 17-23 February 2002. Springer-Verlag, Berlin Heidelberg. ISBN: 3-540-43219-1. (PDF).
  • Steinberger Ralf  (2001). Cross-lingual Keyword Assignment. Proceedings of the XVII Congress of the Spanish Society for Natural Language Processing (SEPLN'2001). Procesamiento del Lenguaje Natural, Revista No 27, pp. 273-280. Jaén, Spain, September 2001. ISSN 1135-5948. (PDF).
  • Steinberger Ralf, Stefan Scheer & Johan Hagman (2001). Language Engineering. ISIS Annual Report 2000, pages 47-48. Office for Official Publications of the European Communities, Luxembourg, 2001. ISBN 92-894-0602-X.
  • Steinberger Ralf, Johan Hagman & Stefan Scheer (2000). Using Thesauri for Information Extraction and for the Visualisation of Multilingual Document Collections. Proceedings of the Workshop on Ontologies and Lexical Knowledge Bases (OntoLex’2000), pp. 130-141. Sozopol, Bulgaria, September 2000. (PDF)
  • Hagman Johan, Domenico Perrotta, Ralf Steinberger & Aristide Varfis (2000). Document Classification and Visualisation to Support the Investigation of Suspected Fraud. Working Notes of the Workshop on Machine Learning and Textual Information Access (MLTIA) at the Fourth European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD’2000), 12 pages. Lyon, September 2000. (PDF, Cover)
  • Garg Anjula, Thomas Barbas & Ralf Steinberger (2000). Information Management and Computer Communications for Anti-Fraud. ISIS Annual Report 1999, pages 79-80. Office for Official Publications of the European Communities, Luxembourg, 2000. ISBN 92-828-9029-5.
  • Barbas Thomas & Ralf Steinberger (1999). New Information Infrastructures. ISIS Annual Report 1998, pages 25-26. Office for Official Publications of the European Communities, Luxembourg, 1999. ISBN 92-828-6645-9.
  • Hagman Johan, Ralf Steinberger, Domenico Perrotta & Aristide Varfis (1999). Approaches to document classification and visualisation. Working Notes of the Workshop on Text Mining at the Sixth International Joint Conference on Artificial Intelligence (IJCAI'99), pages 36-37. Stockholm, August 1999. JRC reference number: ORA 60278. (PDF)
  • Sanfilippo Antonio & Ralf Steinberger (1997): Automatic Selection and Ranking of Translation Candidates. Proceedings of the 7th Conference on Theoretical and Methodological Issues in Machine Translation: "MT Yesterday, Today, and Tomorrow" (TMI'97), Santa Fe, New Mexico, USA. (PDF)
  • Steinberger Ralf (1994): Treating `Free Word Order' in Machine Translation. In: Proceedings of the 15th International Conference for Computational Linguistics (COLING 1994), Vol. I, pp. 69-75, Kyoto, Japan. (PDF)
  • Steinberger Ralf (1994): Lexikoneinträge für deutsche Adverbien (Dictionary Entries for German Adverbs). In: Harald Trost (Hg.): Informatik Xpress 6: Tagungsband KONVENS '94 Verarbeitung natürlicher Sprache (2. Konferenz zur Verarbeitung natürlicher Sprache), pages 320-329, Vienna. (PDF)
  • Steinberger Ralf & Paul Bennett (1994): Automatic Recognition of Theme, Focus and Contrastive Stress. In: Peter Bosch & Rob van der Sandt (eds.): Focus and Natural Language Processing, Proceedings of a conference in celebration of the 10th anniversary of the Journal of Semantics, Working Paper 6 of the IBM Institute for Logic and Linguistics, Vol. I, pages 205-214, Meinhard-Schwebda (Germany). (PDF)
  • Steinberger Ralf () (1994): ヨーロッパの現在のMT活動 (Current MT Activities in Europe). In: AAMT Journal - The Asia-Pacific Association for Machine Translation, No. 7, June 1994, pages 10-14, Tokyo (An English version appeared in the English edition of the AAMT Journal). (PDF)
  • Steinberger Ralf (1994): A study of German word order in German, with special reference to modifier placement. Ph.D. Thesis, Umist, Manchester, UK. (PDF)
  • Steinberger Ralf (1993): Grenzen und Möglichkeiten der Maschinellen Übersetzung (Machine Translation: Prospects and Limitations). In: Informatik Forum - Fachzeitschrift für Informatik, Band 7, Doppelheft 1/2, 6/93, Vienna, Austria.
  • Steinberger Ralf (1992): Beschreibung der Adverbstellung im deutschen und englischen Satz im Hinblick auf Maschinelle Übersetzung (Adverb placement in German and English with special reference to Machine Translation). EUROTRA-D Working Paper 23, Saarbrücken (IAI), 2/92 (47 pages) (PDF)
  • Steinberger Ralf (1992): Der Skopus von Gradpartikeln: Seine Übersetzung und seine Implementierung im Maschinellen Übersetzungssystem CAT2 (Scope of degree modifiers: Translation and implementation in the CAT2 MT formalism). EUROTRA-D Working Paper 24, Saarbrücken (IAI), 4/92 (35 pages). (PDF)

Reports (Restricted Distribution)     Please contact the author for a copy

  • Best Clive, Ralf Steinberger & Stamatia Halkia (2007). Web Mining and Intelligence (EMM) - Support to External Security Unit. Activity Report 2005/2006. European Communities 2007. 17 pages. ISBN 92-79-03400-6.
  • Ribeiro António & Ralf Steinberger (2004). IDoRA for OLAF - Final project report. JRC Technical Note, 23 pages. March 2004.
  • Steinberger Ralf & Bruno Pouliquen (2003). Cross-lingual Indexing. Final Report for the IPSC Exploratory Research Project. JRC Internal Note, 30 pages. October 2003. (PDF)
  • Pedersen Jane & Ralf Steinberger (2002).Evaluation of Multilingual Name Recognition Software - Thing Finder (TM) 2.2. JRC Technical Note No. I.02.120, 29 pages. December 2002 (PDF).
  • Scheer Stefan, Ralf Steinberger & Giovanni Valerio (2000): A Methodology to Retrieve, to Manage, to Classify and to Query Open Source Information - Results of the OSILIA Project. JRC Technical Note No. I.01.016. 35 pages. (PDF)
  • Steinberger Ralf (2000): Evaluation of DMP's Linguistic Software - Comments on the linguistic software distributed by Document Management Partners (DMP) in Antwerp (B). Report for OLAF. 16 pages.
  • Steinberger Ralf, Johan Hagman & Thomas Barbas (2000): Modus Operandi Final Project Report – Summary and Conclusions. JRC Technical Note No. I00.88. 17 pages.
  • Steinberger Ralf (2000): Software Solutions to Overcome the Language Barrier. JRC Technical Note No. I.00.91. 10 pages.
  • Steinberger Ralf & Johan Hagman (2000): Commercial Keyword Identification and Clustering Software. JRC Technical Note No. I.00.90. 19 pages.
  • Steinberger Ralf (2000): The Free Text Field of the IRENE Database. JRC Technical Note No. I.00.89. 28 pages.
  • Steinberger Ralf (2000): Fraud-Related Multi-Word Expressions - English, French and German. Modus Operandi deliverable 7. 50 pages.
  • Hagman Johan & Ralf Steinberger (1999): Clustering of 1500 IRENE Record Text Files. Modus Operandi deliverable 15. 50 pages.
  • Steinberger Ralf (1999). Language Engineering Technologies and their use for TF-UCLAF. JRC Technical Note No. I.99.83. 28 pages.
  • Steinberger Ralf (1997): Multilingual Phrase Book Design Study - Issues regarding the extension of the Japanese-English Phrase Book to German, French, Spanish and Italian. Sharp internal document. 15 pages.
  • Johnson Ian, Osamu Nishida, Junzo Ogawa, Ralf Steinberger (1997): Multilingual Phrase Book Data Format (v. 3) - Representation of the Multilingual Phrase Book data, Sharp internal document. 23 pages.
  • Steinberger Ralf (1997): Multilingual Phrase Book Instructions (v. 2) - Task description for translators, Sharp internal document.
  • Steinberger Ralf (1997): Sharp Abridgement Machine (SAM), Sharp internal document. 9 pages.
  • Steinberger Ralf (1997): Multilingual Document Generator - Instructions, Sharp internal document. 16 pages.
  • Steinberger Ralf (1996): Conversion of machine-readable dictionaries to electronic dictionaries. Sharp internal document. 20 pages.
  • Steinberger Ralf (1995): Evaluation of the Sharp Intelligent Dictionary (SID). Sharp internal document. 11 pages.
  • Steinberger Ralf, Chris Chambers, Ingrid Weber & Blaise Nkwenti-Azeh (1994): English Coverage Definition. Internal report Nr. 6 for the MLAP project TRADE on the linguistic phenomena occurring in a legal social security text, 34 pages, Barcelona
  • Steinberger Ralf & Chris Chambers (1994): English Test Suite. Internal report Nr. 7 for the MLAP project TRADE including a suite of sentences for the testing of the TRAnslation DEmonstrator, 15 pages, Barcelona
  • Mazzini Gianpaolo, Maite Melero & Ralf Steinberger (1994): Corpus Study and Coverage Definition. Internal report for the MLAP project TRADE, Barcelona
  • Steinberger Ralf (1994): The Legal Sublanguage in the English Version of the `United Nations Convention on Contracts for the International Sale of Goods'. Report on work carried out at the Kyushu Institute of Technology for the project 法律エキスパートシステム (Legal Expert System), 50 pages, Iizuka, Japan
  • Steinberger Ralf (1993): Cost, Calculation & Financing: Description of the possible Cost Factors. In: S. Krauwer (ed.), A. Bech, B. Maegaard, M. Mendes, R. Steinberger & N. Underwood: How to produce an application - the long way from a brilliant idea to a commercial product, CCL Report 93/1, pages 19-37, Manchester (also appeared as EUROTRA internal paper, Luxembourg).
  • Steinberger Ralf (1993): Corpus Annotation and Use of Corpora. Internal Report for the CALL project of the Teaching and Learning Technology Programme, 4/93, 7 pages, Manchester
  • Steinberger Ralf & Cécile Potier (1992): How to deal with `empty' subjects in sentential verb complements, UMIST - CCL Report 92/14, Manchester (62 pages, also appeared as the final report of the EUROTRA Contrastive Research Cluster Sentential Complementation, Luxembourg).
  • Steinberger Ralf (1992): Empty subjects in sentential verb complements (French-English). Linguistics. EUROTRA intermediate report, 5/92 (30 pages), Luxembourg
  • Steinberger Ralf (1992): Empty subjects in sentential verb complements (French-English). Implementation. EUROTRA intermediate report, 9/92 (11 pages), Luxembourg
  • Steinberger Ralf (1992): Report of the Implementation of the French-English Transfer Module. EUROTRA final report, 12/92 (7 pages), Luxembourg

Keywords (English, German, French):
computational linguistics, text mining, information extraction, multilingual, cross-lingual, linguist, linguistics, corpus linguistics, natural language processing, medical intelligence, Computerlinguistik, Mehrsprachigkeit, natürliche Sprachverarbeitung, sprachübergreifend, Linguistik, Informationsextraktion, linguistique informatique, multilingue, traitement du langage naturel, linguistique, linguiste, extraction de l'information, linguiste.



Site Meter

Please send comments on this page to Ralf Steinberger (Email address format: Firstname.Lastname@jrc.ec.europa.eu)

Last update:  17 September 2014