GRK 1994: Adaptive Informationsaufbereitung aus heterogenen Quellen
Systems Engineering
Final Report Abstract
The overall goal of AIPHES was the automatic information preparation from heterogeneous sources which fits in the overarching objective of the participating institutions: foster future and digital technologies to facilitate novel ways of communicating, collaborating and optimizing processes. In particular, AIPHES aimed at multi-document summarization (MDS) from heterogeneous sources, i.e., the development of automated methods for condensing multiple documents of heterogeneous nature and sources into a coherent, informative, and homogeneous summary. MDS from heterogeneous sources is a task that requires expertise from multiple disciplines. Therefore, we brought together researchers from computational linguistics, natural language processing, machine learning, information management, and algorithmics, who not only work on foundational and challenging problems in their respective domains, but also joined forces across disciplines to achieve this common goal. To facilitate research across areas, we created cross-disciplinary working groups (AGs) and special interest groups (SIGs) to create shared resources and to enable PhD students to acquire relevant knowledge and skills. In particular, as there did not exist a German corpus for the MDS task, all research areas contributed to corpus collection and annotation. The qualification concept within the RTG included a collaboration across disciplines and locations, intensive international networking, scientific consultation of at least two PhD advisors and of one international co-advisor for each doctoral project, and the responsible participation of excellent post-doctoral researchers in doctoral supervision and training jointly with experienced advisors. The graduate program formed a central location for the qualification of young researchers in this highly demanded academic field in the Rhine-Main-Neckar region.
Publications
- Automatic disambiguation of English puns. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pages 719-729, Beijing, China, July 2015
Tristan Miller and Iryna Gurevych
(See online at https://doi.org/10.3115/v1/P15-1070) - Delexicalized Supervised German Lexical Substitution. In Proceedings of GermEval 2015: LexSub, pages 11-16, Duisburg-Essen, Germany, 2015
Gerold Hintz and Chris Biemann
- GermaNER: Free Open German Named Entity Recognition Tool. In Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology (GSCL), pages 31-38, Essen, Germany, 2015
Darina Benikova, Seid Muhie Yimam, Prabhakaran Santhanam, and Chris Biemann
- GermEval 2015: LexSub - A Shared Task for German- language Lexical Substitution. In Proceedings of GermEval 2015: LexSub, pages 1-9, Essen, Germany, 2015
Tristan Miller, Darina Benikova, and Sallam Abualhaija
- GermEval 2015: LexSub - A Shared Task for German- language Lexical Substitution. In Proceedings of GermEval 2015: LexSub, pages 1-9, Essen, Germany, 2015
Tristan Mille., Darina Benikova, and Sallam Abualhaija
- HITS at TAC KBP 2015: Entity discovery and linking, and event nugget detection. In Proceedings of the 8th Text Analysis Conference (TAC), Gaithersburg, MD, USA, 2015
Benjamin Heinzerling, Alex Judea, and Michael Strube
- SeqCluSum: Combining Sequential Clustering and Contextual Importance Measuring to Summarize Developing Events over Time. In Proceedings of the 24th Text Retrieval Conference (TREC), Gaithersburg, MD, USA, November 2015
Markus Zopf
- Visual Error Analysis for Entity Linking. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL/IJCNLP): System Demonstrations, pages 37-42, Beijing, China, July 2015
Benjamin Heinzerling and Michael Strube
(See online at https://doi.org/10.3115/v1/P15-4007) - A General Optimization Framework for Multi-Document Summarization Using Genetic Algorithms and Swarm Intelligence. In Proceedings of the 26th International Conference on Computational Linguistics (COLING), pages 247-257, Osaka, Japan, December 2016
Maxime Peyrard and Judith Eckle-Kohler
- AIPHES-HD system at TAC KBP 2016: Neural Event Trigger Detection and Event Type and Realis Disambiguation with Word Embeddings. In Proceedings of the TAC Knowledge Base Population (KBP), pages 1-2, Gaithersburg, MD, USA, November 2016
Todor Mihaylov and Anette Frank
- Beyond Centrality and Structural Features: Learning Information Importance for Text Summarization. In Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL), pages 84-94, Berlin, Germany, August 2016
Markus Zopf, Eneldo Loza Mencía, and Johannes Fürnkranz
(See online at https://doi.org/10.18653/v1/K16-1009) - Bridging the gap between extractive and abstractive summaries: Creation and evaluation of coherent extracts from heterogeneous sources. In Proceedings of the 26th International Conference on Computational Linguistics (COLING), pages 1039-1050, Osaka, Japan, December 2016
Darina Benikova, Margot Mieskes, Christian MM. Meyer, and Iryna Gurevych.
- C4Corpus: Multilingual Web-size corpus with free license. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC), pages 914-922, Portoroz, Slovenia, May 2016
Ivan Habemal, Omnia Zayed, and Iryna Gurevych
- Call for Discussion: Building a New Standard Dataset for Relation Extraction Tasks. In Proceedings of the 5th Workshop on Automated Knowledge Base Construction (AKBC), pages 92-96, San Diego, CA, USA, June 2016
Teresa Martin, Fiete Botschen, Ajay Nagesh, and Andrew McCallum
(See online at https://doi.org/10.18653/v1/W16-1317) - CNN- and LSTM-based Claim Classification in Online User Comments. In Proceedings of the 26th International Conference on Computational Linguistics (COLING), pages 2740-2751, Osaka, Japan, December 2016
Chinnappa Guggilla, Tristan Miller, and Iryna Gurevych
- Data-driven Paraphrasing and Stylistic Harmonization. In Proceedings of the Student Research Workshop at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (SRW@HLT-NAACL), pages 37-44, San Diego, CA, USA, 2016
Gerold Hintz
(See online at https://doi.org/10.18653/v1/N16-2006) - Discourse Relation Sense Classification Using Cross-argument Semantic Similarity Based on Word Embeddings. In Proceedings of the 20th Conference on Computational Natural Language Learning (CoNLL): Shared Task, pages 100-107, Berlin, Germany, August 2016
Todor Mihaylov and Anette Frank
(See online at https://doi.org/10.18653/v1/K16-2014) - EmpiriST: AIPHES Robust Tokenization and POS-Tagging for Different Genres. In Proceedings of the 10th Web as Corpus Workshop (WAC-X), page 106-114, Berlin, Germany, August 2016
Steffen Remus, Gerold Hintz, Darina Benikova, Thomas Arnold, Judith EckIe-Kohler, Chistian Ml. Meyer, Margot Mieskes, and Chris Biemann
(See online at https://doi.org/10.18653/v1/w16-2613) - EmpiriST: AIPHES Robust Tokenization and POS-Tagging for Different Genres. In Proceedings of the 10th Web as Corpus Workshop (WAC-X), page 106-114, Berlin, Germany, August 2016
Steffen Remus, Gerold Hintz, Darina Benikova, Thomas Arnold, Judith Eckle-Kohler, Christian l. Meyer, Margot Mieskes, and Chris Biemann
(See online at https://doi.org/10.18653/v1/W16-2613) - Hunting for Troll Comments in News Community Forums. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 399-405, Berlin, Germany, August 2016
Todor Mihaylov and Preslav Nakov
(See online at https://doi.org/10.18653/v1/P16-2065) - Language Transfer Learning for Supervised Lexical Substitution. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL), pages 118-129, Berlin, Germany, August 2016
Gerold Hintz and Chris Biemann
(See online at https://doi.org/10.18653/v1/P16-1012) - MDSWriter: Annotation tool for creating high-quality multi-document summarization corpora. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL): System Demonstrations, pages 97-102, Berlin, Germany, August 2016
Christian M. Meyer, Darina Benikova, Margot Mieskes, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/P16-4017) - Modal Sense Classification At Large: Paraphrase-Driven Sense Projection, Semantically Enriched Classification Models and Cross-Genre Evaluations. Linguistic Issues in Language Technology, Special issue on "Modality in Natural Language Understanding", 14 (3), August 2016
Ana Marasovic, Mengfei Zhou, Alexis Palmer, and Anette Frank
- Multilingual Modal Sense Classification using a Convolutional Neural Network. In Proceedings of the 1st Workshop on Representation Learning for NLP, pages 111-120, Berlin, Germany, 2016
Ana Marasovic and Anette Frank
(See online at https://doi.org/10.18653/v1/W16-1613) - Network Motifs May Improve Quality Assessment of Text Documents. In Proceedings of TextGraphs-10: the Workshop on Graph-based Methods for Natural Language Processing, pages 20-28, San Diego, CA, USA, June 2016
Thomas Arnold and Karsten Weihe
(See online at https://doi.org/10.18653/v1/W16-1404) - Optimizing an Approximation of ROUGE - a Problem-Reduction Approach to Extractive Multi-Document Summarization. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL), volume 1: Long Papers, pages 1825-1836, Berlin, Germany, August 2016
Maxime Peyrard and Judith Eckle-Kohler
- Porting an Open Information Extraction System from English to German. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 892-898, Austin, TX, USA, November 2016
Tobias Falke, Gabriel Stanovsky, Iryna Gurevych, and Ido Dagan
(See online at https://doi.org/10.18653/v1/D16-1086) - SemanticZ at SemEval-2016 Task 3: Ranking Relevant Answers in Community Question Answering Using Semantic Similarity Based on Finetuned Word Embeddings. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval), pages 804-811, San Diego, CA, USA, 2016
Todor Mihaylov and Preslav Nakov
(See online at https://doi.org/10.18653/v1/S16-1136) - Sequential Clustering and Contextual Importance Measures for Incremental Update Summarization. In Proceedings of the 26th International Conference on Computational Linguistics (COLING), pages 1071-1082, Osaka, Japan, December 2016
Markus Zopf, Eneldo Loza Mencía, and Johannes Fürnkranz
- Super Team at SemEval-2016 Task 3: Building a Feature-Rich System for Community Question Answering. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pages 836-843, San Diego, CA, USA, June 2016
Tsvetomila Mihaylova, Pepa Gencheva, Martin Boyanov, Ivana Yovcheva, Todor Mihaylov, Momchil Hardalov, Yasen Kiprov, Daniel Balchev, Ivan Koychev, Preslav Nakov, Ivelina Nikolova, and Galia Angelova
(See online at https://doi.org/10.18653/v1/S16-1129) - The Next Step for Multi-Document Summarization: A Heterogeneous Multi-Genre Corpus Built with a Novel Construction Approach. In Proceedings of the 26th International Conference on Computational Linguistics (COLING), pages 1535-1545, Osaka, Japan, December 2016
Markus Zopf, Maxime Peyrard, and Judith Eckle-Kohler
- Towards the Automatic Detection and Identification of English Puns. European Journal of Humour Research, 4(1):59-75, January 2016
Tristan Miller and Mladen Turkovic
(See online at https://doi.org/10.7592/EJHR2016.4.1.miller) - A Framework for Automated Fact-Checking for Real-Time Validation of Emerging Claims on the Web. In Proceedings of the NIPS Workshop on Prioritising Online Content, page online, Long Beach, LA, USA, November 2017
Andreas Hanselowski and Iryna Gurevych
- A Mention-Ranking Model for Abstract Anaphora Resolution. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 221-232, Copenhagen, Denmark, September 2017
Ana Marasovic, Leo Born, Juri Opitz, and Anette Frank
(See online at https://doi.org/10.18653/v1/D17-1021) - A Principled Framework for Evaluating Summarizers: Comparing Models of Summary Quality against Human Judgments. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), volume 2: Short Papers, pages 26-31, Vancouver, BC, Canada, August 2017
Maxime Peyrard and Judith Eckle-Kohler
(See online at https://doi.org/10.18653/v1/P17-2005) - An Information Nutritional Label for Online Documents. In SIGIR Forum, volume 51, pages 46-66, December 2017
Norbert Fuhr, Anastasia Giachanou, Gregory Grefenstette, Iryna Gurevych, Andreas Hanselowski, Kalervo Jarvelin, Rosie Jones, Yiqun Liu, Josiane Mothe, Isabella Peters, and Benno Stein
(See online at https://doi.org/10.1145/3190580.3190588) - Bringing Structure into Summaries: Crowdsourcing a Benchmark Corpus of Concept Maps. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2951-2961, Copenhagen, Denmark, September 2017
Tobias Falke and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/D17-1320) - Concept-Map-Based Multi-Document Summarization using Concept Coreference Resolution and Global Importance Optimization. In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 801-811, Taipei, Taiwan, November 2017
Tobias Falke, Christian M. Meyer, and Iryna Gurevych
- Experimental study of multimodal representations for Frame Identification - How to find the right multimodal representations for this task? In Language-Learning-Logic Workshop (3L 2017), London, UK, September 2017
Teresa Botschen, Hatem Mousselly-Sergieh, and Iryna Gurevych
- GraphDocExplore: A Framework for the Experimental Comparison of Graphbased Document Exploration Techniques. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP): System Demonstrations, pages 19-24, Copenhagen, Denmark, September 2017
Tobias Falke and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/D17-2004) - Is Interaction More Important Than Individual Performance? A Study of Motifs in Wikia. In Proceedings of the 26th International Conference Companion on World Wide Web, pages 1609-1617, Perth, Australia, April 2017
Thomas Arnold, Johannes Daxenberger, Karsten Weihe, and Iryna Gurevych
(See online at https://doi.org/10.1145/3041021.3053362) - Joint Optimization of User-desired Content in Multi-document Summaries by Learning from User Feedback. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), volume 1: Long Paper, pages 1353-1363, Vancouver, BC, Canada, August 2017
Avinesh P. V. S. and Christian M. Meyer
(See online at https://doi.org/10.18653/v1/P17-1124) - Large-Scale Goodness Polarity Lexicons for Community Question Answering. In SIGIR ’17 Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1185-1188, Shinjuku, Tokyo, Japan, August 2017
Todor Mihaylov, Daniel Balchev, Yasen Kiprov, Ivan Koychev, and Preslav Nakov
(See online at https://doi.org/10.1145/3077136.3080757) - Learning to Score System Summaries for Better Content Selection Evaluation. In Proceedings of the EMNLP workshop "New Frontiers in Summarization", pages 74-84, Copenhagen, Denmark, September 2017
Maxime Peyrard, Teresa Botschen, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/W17-4510) - LSDSem 2017: Exploring Data Generation Methods for the Story Cloze Test. In Proceedings of the 2nd Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics (LSDSem), pages 56-61, Valencia, Spain, April 2017
Michael Bugert, Yevgeniy Puzikov, Andreas Rücklé, Judith Eckle-Kohler, Teresa Martin, Eugenio Martínez-Cámara, Daniil Sorokin, Maxime Peyrard, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/W17-0908) - Neural Skill Transfer from Supervised Language Tasks to Reading Comprehension. In Proceedings of the NIPS Workshop on Learning with Limited Labeled Data: Weak Supervision and Beyond, Long Beach, CA, USA, 2017
Todor Mihaylov, Zornitsa Kozareva, and Anette Frank
- Out-of-domain FrameNet Semantic Role Labelingr. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 471-482, Valencia, Spain, April 2017
Silvana Hartmann, Iiet Kuznetsov, Teresa Martin, and Iryna Gurevych
- Prediction of Frame-to-Frame Relations in the FrameNet Hierarchy with Frame Embeddings. In Proceedings of the 2nd ACL Workshop on Representation Learning for NLP (RepL4NLP), pages 146-156, Vancouver, BC, Canada, August 2017
Teresa Botschen, Hatem Mousselly-Sergieh, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/W17-2618) - Revisiting Selectional Preferences for Coreference Resolution. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 1343-1350, Copenhagen, Denmark, 2017
Benjamin Heinzerling, Nafiise Sadat Moosavi, and Michael Strube
(See online at https://doi.org/10.18653/v1/D17-1138) - Story Cloze Ending Selection Baselines and Data Examination. In Proceedings of the EMNLP Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics (LSDSem), pages 87-92, Valencia, Spain, April 2017
Todor Mihaylov and Anette Frank
(See online at https://doi.org/10.18653/v1/W17-0913) - Supervised Learning of Automatic Pyramid for Optimization-Based Multi-Document Summarization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), volume 1: Long Papers, pages 1084-1094, Vancouver, BC, Canada, August 2017
Maxime Peyrard and Judith Eckle-Kohler
(See online at https://doi.org/10.18653/v1/P17-1100) - Trust, but Verify! Better Entity Linking through Automatic Verification. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 828-838, Valencia, Spain, April 2017
Benjamin Heinzerling, Michael Stube, and Chin-Yew Lin
- Utilizing Automatic Predicate-Argument Analysis for Concept Map Mining. In Proceedings of the 12th International Conference on Computational Semantics (IWCS), volume 2: Short papers, Montpellier, France, September 2017
Tobias Falke and Iryna Gurevych.
- A Multimodal Translation-Based Approach for Knowledge Graph Representation Learning. In Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (*SEM), pages 225-234, New Orleans, LA, USA, April 2018
Hatem Mousselly Sergieh, Teresa Botschen, Iryna Gurevych, and Stefan Roth
(See online at https://doi.org/10.18653/v1/S18-2027) - A Retrospective Analysis of the Fake News Challenge Stance-Detection Task. In Proceedings of the 27th International Conference on Computational Linguistics (COLING), pages 1859-1874, Santa Fe, NM, USA, August 2018
Andreas Hanselowski, Avinesh P. V. S., Benjamin Schiller, Felix Caspelherr, Debanjan Chaudhuri, Christian M. Meyer, and Iryna Gurevych
- Advanced Motif Analysis on Text Induced Graphs. PhD thesis, Technische Universität, Darmstadt, May 2018
Thomas Otmar Arnold
- ArgumenText: Searching for Arguments in Heterogeneous Sources. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL): System Demonstrations, pages 21-25, New Orleans, Louisiana, June 2018
Christian Stab, Johannes Daxenberger, Chris Stahlhut, Tristan Miller, Benjamin Schiller, Christopher Tauchmann, Steffen Eger, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/N18-5005) - auto-hMDS: Automatic Construction of a Large Heterogeneous Multilingual Multi-Document Summarization Corpus. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), pages 3228-3233, Miyazaki, Japan, May 2018
Markus Zopf
- Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), pages 3184-3191, Miyazaki, Japan, May 2018
Christopher Tauchmann, Thomas Arnold, Andreas Hanselowski, Christian M. Meyer, and Margot Mieskes
- BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), pages 2989-2993, Miyazaki, Japan, 2018
Benjamin Heinzerling and Michael Strube
- Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2381-2391, Brussels, Belgium, August 2018
Todor Mihaylov, Peter Clark, Tushar Khot, and Ashish Sabharwal
(See online at https://doi.org/10.18653/v1/D18-1260) - Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations. March 2018
Andreas Rücklé, Steffen Eger, Maxime Peyrard, and Iryna Gurevych
(See online at https://doi.org/10.48550/arXiv.1803.01400) - Estimating Summary Quality with Pairwise Preferences. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), page 1687-1696, Miyazaki, Japan, June 2018
Markus Zopf
(See online at https://doi.org/10.18653/v1/N18-1152) - Frame- and Entity-Based Knowledge for Common-Sense Argumentative Reasoning. In Proceedings of the 5th Workshop on Argument Mining held in conjunction with EMNLP 2018, volume Short Papers, pages 90-96, Brussels, Belgium, August 2018
Teresa Botschen, Daniil Sorokin, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/W18-5211) - Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), pages 821-832, Melbourne, Australia, 2018
Todor Mihaylov and Anette Frank
(See online at https://doi.org/10.18653/v1/P18-1076) - Live Blog Corpus for Summarization. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), pages 3197-3203, Miyazaki, Japan, May 2018
Avinesh P. V. S., Maxime Peyrard, and Chnstian M. Meyer
- Multimodal Frame Identification with Multilingual Evaluation. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 1481-1491, New Orleans, LA, USA, June 2018
Teresa Botschen, Iryna Gurevych, Jan-Christoph Klie, Hatem Mousselly Sergieh, and Stefan Roth
(See online at https://doi.org/10.18653/v1/N18-1134) - Multimodal Grounding for Language Processing. In Proceedings of the 27th International Conference on Computational Linguistics (COLING), pages 2325-2339, Santa Fe, NM, USA, June 2018
Teresa Botschen, Lisa Beinborn, and Iryna Gurevych
- Objective Function Learning to Match Human Judgements for Optimization-Based Summarization. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 654-660, New Orleans, LA, USA, June 2018
Maxime Peyrard and Iyna Gurevych
(See online at https://doi.org/10.18653/v1/N18-2103) - Sherlock: A System for Interactive Summarization of Large Text Collections. In Proceedings of the VLDB Endowment, volume 11, pages 1902-1905, Rio De Janerio, Brazil, July 2018
Avinesh P. V. S., Benjamin Hattasch, Orkan Ozyurt, Carsten Binnig, and Christian M. Meyer
(See online at https://doi.org/10.14778/3229863.3236220) - SRL4ORL: Improving Opinion Role Labeling using Multitask Learning with Semantic Role Labeling. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 583-594, New Orleans, LA, USA, 2018
Ana Marasovic and Anette Frank
(See online at https://doi.org/10.18653/v1/N18-1054) - Towards Interactive Summarization of Large Document Collections. In Proceedings of the First Biennial Conference on Design of Experimental Search and Information Retrieval Systems (DESIRES), volume 2167 of CEUR Workshop Proceedings, page 103, 2018
Benjamin Hattasch
- UKP-Athene: Multi-Sentence Textual Entailment for Claim Verification. In Proceedings of the First Workshop on Fact Extraction and VERification (FEVER), pages 103-108, Brussels, Belgium, November 2018
Andreas Hanselowski, Hao Zhang, Zile Li, Daniil Sorokin, Benjamin Schiller, Claudia Schulz, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/W18-5516) - What’s Important in a Text? An Extensive Evaluation of Linguistic Annotations for Summarization. In Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS), pages 272-277, Valencia, Spain, October 2018
Markus Zopf, Teresa Botschen, Tobias Falke, Benjamin Heinzerling, Ana Marasovic, Todor Mihaylov, Avinesh P. V. S., Eneldo Loza Mencia, Johannes Fürnkranz, and Anette Frank
(See online at https://doi.org/10.1109/SNAMS.2018.8554853) - Which Scores to Predict in Sentence Regression for Text Summarization? In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), page 1782-1791, New Orleans, LA, USA, June 2018
Markus Zopf, Eneldo Loza Menc.a, and Johannes Fürnkranz
(See online at https://doi.org/10.18653/v1/N18-1161) - A Richly Annotated Corpus for Different Tasks in Automated Fact-Checking. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pages 493-503, Hong Kong, China, November 2019
Andreas Hanselowski, Christian Stab, Claudia Schulz, Zile Li, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/K19-1046) - Data-efficient Neural Text Compression with Interactive Learning. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, pages 2543-2554, Minneapolis, USA, February 2019
Avinesh P. V. S. and Christian M. Meyer
(See online at https://doi.org/10.18653/v1/N19-1262) - DBPal: Weak Supervision for Learning a Natural Language Interface to Databases. In 1st International Workshop on Conversational Access to Data (CAST) in conj. with the 45th International Conference on Very Large Data Bases (VLDB), Los Angeles, California, USA, August 2019
Nathaniel Weir, Andrew Crotty, Alex Galakatos, Amir Ilkhechi, Shekar Ramaswamy, Rohin Bhushan, Ugur Cetintemel, Prasetya Utama, Nadja Geisler, Benjamin Hättasch, Steffen Eger and Carsten Binnig
- Enhancing AMR-to-Text Generation with Dual Graph Representations. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 3183-3194, Hong Kong, China, August 2019
Leonardo F. R. Ribeiro, Claire Gardent, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/D19-1314) - Evaluating Machine Translation without Human References Using Cross-lingual Encoders. In The first annual EurNLP summit, Facebook London, London, United Kingdom, October 2019
Wei Zhao, Yang Gao, and Steffen Eger
- Fast Concept Mention Grouping for Concept Map-based Multi-Document Summarization. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 695-700, Minneapolis, Minnesota, June 2019
Tobias Falke and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/N19-1074) - Fine-Grained Entity Typing in Hyperbolic Space. In Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019), pages 169-180, Florence, Italy, August 2019
Federico López, Benjamin Heinzerling, and Michael Strube
(See online at https://doi.org/10.18653/v1/W19-4319) - Handling Noisy Labels for Robustly Learning from Self-Training Data for Low-Resource Sequence Labeling. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 29-34, Minneapolis, Minnesota, June 2019
Debjit Paul, Mittul Singh, Michael A. Hedderich, and Dietrich Klakow
(See online at https://doi.org/10.18653/v1/N19-3005) - Improving Generalization by Incorporating Coverage in Natural Language Inference. September 2019
Naffise Sadat Moosavi, Prasetya Utama, Andreas Rücklé, and Iyna Gurevych
(See online at https://doi.org/10.48550/arXiv.1909.08940) - Information Preparation with the Humanin the Loop. PhD thesis, TU Darmstadt, Darmstadt, 2019
Avinesh P. V. S.
(See online at https://doi.org/10.25534/tuprints-00011839) - Interactive Summarization of Large Document Collections. In Proceedings of the Workshop on Human-In-the-Loop Data Analytics, Amsterdam, Netherlands, 2019
Benjamin Hattasch, Christian Ml. Meyer, and Carsten Binnig
(See online at https://doi.org/10.1145/3328519.3329129) - J3R: Joint Multi-task Learning of Ratings and Review Summaries for Explainable Recommendation. In Proceedings of The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), volume 11908 of Lecture Notes in Computer Science, pages 339-355, Wurzburg, Germany, August 2019
Avinesh P. V. S., Yongli Ren, Christian M. Meyer, Jeffrey Chan, Zhifeng Bao, and Mark Sanderson
(See online at https://doi.org/10.1007/978-3-030-46133-1_21) - Joint Wasserstein Autoencoders for Aligning Multimodal Embeddings. In 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), Seoul, Korea, October 2019
Shweta Mahajan, Teresa Botschen, Iryna Gurevych, and Stefan Roth
(See online at https://doi.org/10.1109/ICCVW.2019.00557) - Learning Analogy-Preserving Sentence Embeddings for Answer Selection. In Proceedings of the 23rd Conference on Computational Natural Language Learning, pages 910-919, Hong Kong, China, November 2019
Aissatou Diallo, Markus Zopf, and Johannes Fürnkranz
(See online at https://doi.org/10.18653/v1/K19-1085) - Learning Dual Graph Representations for AMR-to-Text Generation. In The First International Workshop on Deep Learning on Graphs: Methods and Applications (DLG’19), in Conjunction with the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Anchorage, Alaska, USA, August 2019
Leonardo F. R. Ribeiro, Claire Gardent, and Iryna Gurevych
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), page 563-578, Hong Kong, China, August 2019
Wei Zhao, Maxime Peyrard, Fei Liu, Yang Gao, Christian M. Meyer, and Steffen Eger
(See online at https://doi.org/10.18653/v1/D19-1053) - Ranking and Selecting Multi-Hop Knowledge Paths to Better Predict Human Needs. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 3671-3681, Minneapolis, Minnesota, June 2019
Debjit Paul and Anette Frank
(See online at https://doi.org/10.18653/v1/N19-1368) - Ranking Generated Summaries by Correctness: An Interesting but Challenging Application for Natural Language Inference. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 22142220, Florence, Italy, May 2019
Tobias Falke, Leonardo F. R. Ribeiro, Prasetya Utama, Ido Dagan, and Iryna Gurevychl
(See online at https://doi.org/10.18653/v1/P19-1213) - Towards Scalable and Reliable Capsule Networks for Challenging NLP Applications. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), page 1549-1559, Florence, Italy, May 2019
Wei Zhao, Haiyun Peng, Steffen Eger, Eric Cambria, and Min Yang
(See online at https://doi.org/10.18653/v1/P19-1150) - A Fully Hyperbolic Neural Model for Hierarchical Multi-Class Classification. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 460-475, Online, November 2020
Federico López and Míchael Strube
(See online at https://doi.org/10.18653/v1/2020.findings-emnlp.42) - A Machine-Learning-Based Pipeline Approach to Automated Fact-Checking. PhD thesis, Technische Universität, Darmstadt, November 2020
Andreas Hanselowski
(See online at https://doi.org/10.25534/tuprints-00014136) - Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers. In Proceedings of Deep Learning Inside Out (DeeLIO): The First Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, pages 43-49, online, November 2020
Anne Lauscher, Olga Majewska, Leonardo F. R. Ribeiro, Iryna Gurevych, Nikolai Rozanov, and Goran Glavas
(See online at https://doi.org/10.18653/v1/2020.deelio-1.5) - DBPal: A Fully Pluggable NL2SQL Training Pipeline. In SIGMOD20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, pages 2347-2361, virtual Conference, June 2020
Nathaniel Weir, Prasetya Utama, Alex Galakatos, Andrew Crotty, Amir Ilkhechi, Shekar Ramaswamy, Rohin Bhushan, Nadja Geisler, Benjamin Hättasch, Steffen Eger, Ugur Cetintemel, and Carsten Binnig
(See online at https://doi.org/10.1145/3318464.3380589) - Evaluation of Coreference Resolution Systems Under Adversarial Attacks. In Proceedings of the First Workshop on Computational Approaches to Discourse, pages 154-159, Online, November 2020
Haixia Chai, Wei Zhao, Steffen Eger, and Michael Strube
(See online at https://doi.org/10.18653/v1/2020.codi-1.16) - Improving Robustness by Augmenting Training Sentences with Predicate-Argument Structures 2020
Naffise Sadat Moosavi, Marcel de Boer, Prasetya Utama, and Iryna Gurevych
(See online at https://doi.org/10.48550/arXiv.2010.12510) - It’s AI Match: A Two-Step Approach for Schema Matching Using Embeddings. In 2nd International Workshop on Applied AI for Database Systems and Applications, Online, August 2020b. Held with VLDB 2020
Benjamin Hattasch, Michael Truong-Ngoc, Andreas Schmidt, and Carsten Binnig
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings. In The 8thh International Conference on Learning Representations (ICLR), Addis Ababa, Ethiopia, April 2020
Shweta Mahajan, Iryna Gurevych, and Stefan Roth
- Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 8717-8729, Online, July 2020
Prasetya Utama, Nafise Sadat Moosavi, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/2020.acl-main.770) - Modeling Global and Local Node Contexts for Text Generation from Knowledge Graphs. Transactions of the Association for Computational Linguistics (TACL), 8:589-604, July 2020
Leonardo F. R. Ribeiro, Yue Zhang, Claire Gardent, and Iryna Gurevych
(See online at https://doi.org/10.1162/tacl_a_00332) - On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation. In Proceddings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1656 - 1671, Online, July 2020
Wei Zhao, Goran Glavas, Maxime Peyrard, Yang Gao, Robert West, and Steffen Eger
(See online at https://doi.org/10.18653/v1/2020.acl-main.151) - Permutation Learning via Lehmer Codes. In Proceedings of the 24th European Conference on Artificial Intelligence (ECAI), volume 325 of Frontiers in Artificial Intelligence and Applications, pages 1095-1102, Online, August 2020
Aissatou Diallo, Markus Zopf, and Johannes Fürnkranz
(See online at https://doi.org/10.3233/FAIA200206) - Social Commonsense Reasoning with Multi-Head Knowledge Attention. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 2969-2980, Online, November 2020
Debjit Paul and Anette Frank
(See online at https://doi.org/10.18653/v1/2020.findings-emnlp.267) - Summarization Beyond News: The Automatically Acquired Fandom Corpora. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 6700-6708, Marseille, France, May 2020
Benjamin Hattasch, Nadja Geisler, Christian M. Meyer, and Carsten Binnig
- SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1347-1354, Online, July 2020
Yang Gao, Wei Zhao, and Steffen Eger
(See online at https://doi.org/10.18653/v1/2020.acl-main.124) - Towards Debiasing NLU Models from Unknown Biases. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7597-7610, Online, October 2020
Prasetya Utama, Nafise Sadat Moosavi, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/2020.emnlp-main.613) - Towards Robust and Transparent Natural Language Interfaces for Databases. In Workshop on Human-In-the-Loop Data Analytics (HILDA), Portland, USA, June 2020
Christoph Brandt, Benjamin Hättasch, Nadja Geisler, and Carsten Binnig
- ASET: Ad-hoc Structured Exploration of Text Collections. In The 3rd International Workshop on Applied AI for Database Systems and Applications (AIDB), colocated with the 47th International Conference on Very Large Data Bases, page online, Copenhagen, Denmark, August 2021
Benjamin Hattasch, Jan-Micha Bodensohn, and Carsten Binnig
- Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 9063-9074, virtual Conference and Punta Cana, Dominican Republic, November 2021
Prasetya Utama, Nafiise Sadat Moosavi, Victor Sanh, and Iryna Gurevych
(See online at https://doi.org/10.18653/v1/2021.emnlp-main.713) - Better than Average: Paired Evaluation of NLP systems. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pages 2301-2315, Online, August 2021
Maxime Peyrard, Wei Zhao, Steffen Eger, and Robert West
(See online at https://doi.org/10.18653/v1/2021.acl-long.179) - CO-NNECT: A Framework for Revealing Commonsense Knowledge Paths as Explicitations of Implicit Knowledge in Texts. In Proceedings of the 14th International Conference on Computational Semantics (IWCS), pages 21-32, Online, June 2021
Maria Becker, Katharina Korfhage, Debjit Paul, and /Anette Frank
- COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pages 5086-5099, Online, August 2021
Debjit Paul and Anette Frank
(See online at https://doi.org/10.18653/v1/2021.acl-long.395) - Generating Hypothetical Events for Abductive Inference. In Proceedings of *SEM 2021: The 10th Joint Conference on Lexical and Computational Semantics, pages 67-77, Online, August 2021
Debjit Paul and Anette Frank
(See online at https://doi.org/10.18653/v1/2021.starsem-1.6) - Inducing Language-Agnostic Multilingual Representations. In Proceedings of *SEM 2021: The 10th Joint Conference on Lexical and Computational Semantics, pages 229-240, Online, August 2021
Wei Zhao, Steffen Eger, Johannes Bjerva, and Isabelle Augenstein
(See online at https://doi.org/10.18653/v1/2021.starsem-1.22) - Modeling Graph Structure via Relative Position for Text Generation from Knowledge Graphs. In Proceedings of the Fifteenth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs), pages 10-21, Mexico City, Mexico, June 2021
Martin Schmitt, Leonardo F R. Ribeiro, Philipp Dufter, Iryna Gurevych, and Hinrich Schütze
(See online at https://doi.org/10.18653/v1/2021.textgraphs-1.2) - Netted?! How to Improve the Usefulness of Spider & Co. In Proceedings of the Second International Confer ence on Design of Experimental Search & Information REtrieval Systems, volume 2950 of CEUR Workshop Proceedings, pages 38-43, Padova, Italy, September 2021
Benjamin Hattasch, Nadja Geisler, and Carsten Binnig
- Symmetric Spaces for Graph Embeddings: A Finsler-Riemannian Approach. In Proceedings of the 38th International Conference on Machine Learning, volume 139, pages 7090-7101, Online, July 2021
Federico López, Beatrice Pozzetti, Steve Trettel, Michael Strube, and Anna Wienhard
- WannaDB: Ad-hoc Structured Exploration of Text Collections Using Queries. In Proceedings of the 2nd International Biennial Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES), pages 179-180, Padua, Italy, September 2021
Benjamin Hattasch