(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 54 results for author: Ngomo, A N

.
  1. arXiv:2407.06855  [pdf, other

    cs.LG cs.CR

    Performance Evaluation of Knowledge Graph Embedding Approaches under Non-adversarial Attacks

    Authors: Sourabh Kapoor, Arnab Sharma, Michael Röder, Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge Graph Embedding (KGE) transforms a discrete Knowledge Graph (KG) into a continuous vector space facilitating its use in various AI-driven applications like Semantic Search, Question Answering, or Recommenders. While KGE approaches are effective in these applications, most existing approaches assume that all information in the given KG is correct. This enables attackers to influence the o… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.06041  [pdf, other

    cs.CL

    MST5 -- Multilingual Question Answering over Knowledge Graphs

    Authors: Nikit Srivastava, Mengshi Ma, Daniel Vollmers, Hamada Zahera, Diego Moussallem, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge Graph Question Answering (KGQA) simplifies querying vast amounts of knowledge stored in a graph-based model using natural language. However, the research has largely concentrated on English, putting non-English speakers at a disadvantage. Meanwhile, existing multilingual KGQA systems face challenges in achieving performance comparable to English systems, highlighting the difficulty of ge… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2406.19092  [pdf, ps, other

    cs.LG

    Adaptive Stochastic Weight Averaging

    Authors: Caglar Demir, Arnab Sharma, Axel-Cyrille Ngonga Ngomo

    Abstract: Ensemble models often improve generalization performances in challenging tasks. Yet, traditional techniques based on prediction averaging incur three well-known disadvantages: the computational overhead of training multiple models, increased latency, and memory requirements at test time. To address these issues, the Stochastic Weight Averaging (SWA) technique maintains a running average of model p… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.10144  [pdf, other

    cs.AI

    Improving rule mining via embedding-based link prediction

    Authors: N'Dah Jean Kouagou, Arif Yilmaz, Michel Dumontier, Axel-Cyrille Ngonga Ngomo

    Abstract: Rule mining on knowledge graphs allows for explainable link prediction. Contrarily, embedding-based methods for link prediction are well known for their generalization capabilities, but their predictions are not interpretable. Several approaches combining the two families have been proposed in recent years. The majority of the resulting hybrid approaches are usually trained within a unified learni… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 13 pages, 2 figures, 11 tables

  5. arXiv:2312.01973  [pdf, ps, other

    cs.CC cs.LO

    Computing Repairs Under Functional and Inclusion Dependencies via Argumentation

    Authors: Yasir Mahmood, Jonni Virtema, Timon Barlag, Axel-Cyrille Ngonga Ngomo

    Abstract: We discover a connection between finding subset-maximal repairs for sets of functional and inclusion dependencies, and computing extensions within argumentation frameworks (AFs). We study the complexity of the existence of a repair and deciding whether a given tuple belongs to some (or every) repair, by simulating the instances of these problems via AFs. We prove that subset-maximal repairs under… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Pre-print

  6. Universal Knowledge Graph Embeddings

    Authors: N'Dah Jean Kouagou, Caglar Demir, Hamada M. Zahera, Adrian Wilke, Stefan Heindorf, Jiayi Li, Axel-Cyrille Ngonga Ngomo

    Abstract: A variety of knowledge graph embedding approaches have been developed. Most of them obtain embeddings by learning the structure of the knowledge graph within a link prediction setting. As a result, the embeddings reflect only the structure of a single knowledge graph, and embeddings for different knowledge graphs are not aligned, e.g., they cannot be used to find similar entities across knowledge… ▽ More

    Submitted 5 July, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 5 pages, 3 tables

    Journal ref: Companion Proceedings of the ACM Web Conference 2024 (WWW '24 Companion), May 13--17, 2024, Singapore, Singapore

  7. arXiv:2306.09802  [pdf, other

    cs.CL

    RED$^{\rm FM}$: a Filtered and Multilingual Relation Extraction Dataset

    Authors: Pere-Lluís Huguet Cabot, Simone Tedeschi, Axel-Cyrille Ngonga Ngomo, Roberto Navigli

    Abstract: Relation Extraction (RE) is a task that identifies relationships between entities in a text, enabling the acquisition of relational facts and bridging the gap between natural language and structured knowledge. However, current RE models often rely on small datasets with low coverage of relation types, particularly when working with languages other than English. In this paper, we address the above… ▽ More

    Submitted 19 June, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: ACL 2023. Please cite authors correctly using both lastnames ("Huguet Cabot", "Ngonga Ngomo")

  8. arXiv:2304.14742  [pdf, ps, other

    cs.AI

    LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric Literals

    Authors: Caglar Demir, Michel Wiebesiek, Renzhong Lu, Axel-Cyrille Ngonga Ngomo, Stefan Heindorf

    Abstract: Most real-world knowledge graphs, including Wikidata, DBpedia, and Yago are incomplete. Answering queries on such incomplete graphs is an important, but challenging problem. Recently, a number of approaches, including complex query decomposition (CQD), have been proposed to answer complex, multi-hop queries with conjunctions and disjunctions on such graphs. However, all state-of-the-art approaches… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

  9. arXiv:2303.01844  [pdf, ps, other

    cs.LO cs.AI cs.LG

    Learning Permutation-Invariant Embeddings for Description Logic Concepts

    Authors: Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Concept learning deals with learning description logic concepts from a background knowledge and input examples. The goal is to learn a concept that covers all positive examples, while not covering any negative examples. This non-trivial task is often formulated as a search problem within an infinite quasi-ordered concept space. Although state-of-the-art models have been successfully applied to tac… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted at IDA 2023

  10. arXiv:2301.05109  [pdf, other

    cs.AI cs.LG

    Explaining $\mathcal{ELH}$ Concept Descriptions through Counterfactual Reasoning

    Authors: Leonie Nora Sieger, Stefan Heindorf, Yasir Mahmood, Lukas Blübaum, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge bases are widely used for information management, enabling high-impact applications such as web search, question answering, and natural language processing. They also serve as the backbone for automatic decision systems, e.g., for medical diagnostics and credit scoring. As stakeholders affected by these decisions would like to understand their situation and verify how fair the decisions… ▽ More

    Submitted 4 October, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

  11. arXiv:2207.08544  [pdf, ps, other

    cs.LG cs.AI

    Hardware-agnostic Computation for Large-scale Knowledge Graph Embeddings

    Authors: Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge graph embedding research has mainly focused on learning continuous representations of knowledge graphs towards the link prediction problem. Recently developed frameworks can be effectively applied in research related applications. Yet, these frameworks do not fulfill many requirements of real-world applications. As the size of the knowledge graph grows, moving computation from a commodit… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: accepted in Software Impacts journal

  12. arXiv:2205.06560  [pdf, ps, other

    cs.LG cs.AI

    Kronecker Decomposition for Knowledge Graph Embeddings

    Authors: Caglar Demir, Julian Lienen, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge graph embedding research has mainly focused on learning continuous representations of entities and relations tailored towards the link prediction problem. Recent results indicate an ever increasing predictive ability of current approaches on benchmark datasets. However, this effectiveness often comes with the cost of over-parameterization and increased computationally complexity. The for… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Accepted at HT 2022

  13. arXiv:2112.07606  [pdf, ps, other

    cs.CL cs.AI

    Semantic Answer Type and Relation Prediction Task (SMART 2021)

    Authors: Nandana Mihindukulasooriya, Mohnish Dubey, Alfio Gliozzo, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck, Gaetano Rossiello, Uttam Kumar

    Abstract: Each year the International Semantic Web Conference organizes a set of Semantic Web Challenges to establish competitions that will advance state-of-the-art solutions in some problem domains. The Semantic Answer Type and Relation Prediction Task (SMART) task is one of the ISWC 2021 Semantic Web challenges. This is the second year of the challenge after a successful SMART 2020 at ISWC 2020. This yea… ▽ More

    Submitted 10 January, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    ACM Class: F.4.1; I.2.4; I.2.7

  14. Neural Class Expression Synthesis

    Authors: N'Dah Jean Kouagou, Stefan Heindorf, Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Many applications require explainable node classification in knowledge graphs. Towards this end, a popular ``white-box'' approach is class expression learning: Given sets of positive and negative nodes, class expressions in description logics are learned that separate positive from negative nodes. Most existing approaches are search-based approaches generating many candidate class expressions and… ▽ More

    Submitted 13 June, 2024; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: 18 pages, 2 figures, ESWC 2023--Research Track

  15. arXiv:2111.04879  [pdf, other

    cs.AI cs.LG cs.NE

    EvoLearner: Learning Description Logics with Evolutionary Algorithms

    Authors: Stefan Heindorf, Lukas Blübaum, Nick Düsterhus, Till Werner, Varun Nandkumar Golani, Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Classifying nodes in knowledge graphs is an important task, e.g., for predicting missing types of entities, predicting which molecules cause cancer, or predicting which drugs are promising treatment candidates. While black-box models often achieve high predictive performance, they are only post-hoc and locally explainable and do not allow the learned model to be easily enriched with domain knowled… ▽ More

    Submitted 8 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Accepted at WWW 2022

  16. arXiv:2107.04911  [pdf, other

    cs.LG

    Learning Concept Lengths Accelerates Concept Learning in ALC

    Authors: N'Dah Jean Kouagou, Stefan Heindorf, Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Concept learning approaches based on refinement operators explore partially ordered solution spaces to compute concepts, which are used as binary classification models for individuals. However, the number of concepts explored by these approaches can grow to the millions for complex learning problems. This often leads to impractical runtimes. We propose to alleviate this problem by predicting the l… ▽ More

    Submitted 16 May, 2022; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: 15 pages, 2 figures, 7 tables

  17. arXiv:2106.15373  [pdf, ps, other

    cs.AI cs.LG

    DRILL-- Deep Reinforcement Learning for Refinement Operators in $\mathcal{ALC}$

    Authors: Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Approaches based on refinement operators have been successfully applied to class expression learning on RDF knowledge graphs. These approaches often need to explore a large number of concepts to find adequate hypotheses. This need arguably stems from current approaches relying on myopic heuristic functions to guide their search through an infinite concept space. In turn, deep reinforcement learnin… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  18. arXiv:2106.15230  [pdf, other

    cs.LG cs.IR

    Convolutional Hypercomplex Embeddings for Link Prediction

    Authors: Caglar Demir, Diego Moussallem, Stefan Heindorf, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge graph embedding research has mainly focused on the two smallest normed division algebras, $\mathbb{R}$ and $\mathbb{C}$. Recent results suggest that trilinear products of quaternion-valued embeddings can be a more effective means to tackle link prediction. In addition, models based on convolutions on real-valued embeddings often yield state-of-the-art results for link prediction. In this… ▽ More

    Submitted 18 November, 2021; v1 submitted 29 June, 2021; originally announced June 2021.

    Journal ref: The 13th Asian Conference on Machine Learning, ACML 2021

  19. arXiv:2105.12524  [pdf, ps, other

    cs.LG cs.SI

    Out-of-Vocabulary Entities in Link Prediction

    Authors: Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge graph embedding techniques are key to making knowledge graphs amenable to the plethora of machine learning approaches based on vector representations. Link prediction is often used as a proxy to evaluate the quality of these embeddings. Given that the creation of benchmarks for link prediction is a time-consuming endeavor, most work on the subject matter uses only a few benchmarks. As be… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  20. arXiv:2105.03161  [pdf

    cs.DB

    Open Data Portal Germany (OPAL) Projektergebnisse

    Authors: Adrian Wilke, Axel-Cyrille Ngonga Ngomo

    Abstract: In the Open Data Portal Germany (OPAL) project, a pipeline of the following data refinement steps has been developed: requirements analysis, data acquisition, analysis, conversion, integration and selection. 800,000 datasets in DCAT format have been produced.

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: in German

  21. arXiv:2105.00741  [pdf, other

    cs.SE

    MLCheck- Property-Driven Testing of Machine Learning Models

    Authors: Arnab Sharma, Caglar Demir, Axel-Cyrille Ngonga Ngomo, Heike Wehrheim

    Abstract: In recent years, we observe an increasing amount of software with machine learning components being deployed. This poses the question of quality assurance for such components: how can we validate whether specified requirements are fulfilled by a machine learned software? Current testing and verification approaches either focus on a single requirement (e.g., fairness) or specialize on a single type… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  22. arXiv:2104.00984  [pdf, other

    cs.DB cs.LG cs.PF

    An Empirical Evaluation of Cost-based Federated SPARQL Query Processing Engines

    Authors: Umair Qudus, Muhammad Saleem, Axel-Cyrille Ngonga Ngomo, Young-koo Lee

    Abstract: Finding a good query plan is key to the optimization of query runtime. This holds in particular for cost-based federation engines, which make use of cardinality estimations to achieve this goal. A number of studies compare SPARQL federation engines across different performance metrics, including query runtime, result set completeness and correctness, number of sources selected and number of reques… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: 24 pages, Semantic Web, 2020, #article

    Journal ref: Semantic Web 2020

  23. Knowledge Graph Question Answering using Graph-Pattern Isomorphism

    Authors: Daniel Vollmers, Rricha Jalota, Diego Moussallem, Hardik Topiwala, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck

    Abstract: Knowledge Graph Question Answering (KGQA) systems are based on machine learning algorithms, requiring thousands of question-answer pairs as training examples or natural language processing pipelines that need module fine-tuning. In this paper, we present a novel QA approach, dubbed TeBaQA. Our approach learns to answer questions based on graph isomorphisms from basic graph patterns of SPARQL queri… ▽ More

    Submitted 2 February, 2022; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Version published in the proceedings of the 17th International Conference on Semantic Systems

    Journal ref: Further with Knowledge Graphs - Proceedings of the 17th International Conference on Semantic Systems 53 (2021) 103-117

  24. arXiv:2102.13027  [pdf, ps, other

    cs.DB

    A Survey of RDF Stores & SPARQL Engines for Querying Knowledge Graphs

    Authors: Waqas Ali, Muhammad Saleem, Bin Yao, Aidan Hogan, Axel-Cyrille Ngonga Ngomo

    Abstract: RDF has seen increased adoption in recent years, prompting the standardization of the SPARQL query language for RDF, and the development of local and distributed engines for processing SPARQL queries. This survey paper provides a comprehensive review of techniques and systems for querying RDF knowledge graphs. While other reviews on this topic tend to focus on the distributed setting, the main foc… ▽ More

    Submitted 13 October, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: This version adds 15 more systems, more details on approaches for processing property paths, as well as some other minor changes

  25. arXiv:2101.09090  [pdf, other

    cs.LG cs.CL

    A shallow neural model for relation prediction

    Authors: Caglar Demir, Diego Moussallem, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge graph completion refers to predicting missing triples. Most approaches achieve this goal by predicting entities, given an entity and a relation. We predict missing triples via the relation prediction. To this end, we frame the relation prediction problem as a multi-label classification problem and propose a shallow neural model (SHALLOM) that accurately infers missing relations from enti… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: 15th IEEE International Conference on Semantic Computing, ICSC-2021

  26. arXiv:2012.00555  [pdf, ps, other

    cs.AI cs.CL cs.IR

    SeMantic AnsweR Type prediction task (SMART) at ISWC 2020 Semantic Web Challenge

    Authors: Nandana Mihindukulasooriya, Mohnish Dubey, Alfio Gliozzo, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck

    Abstract: Each year the International Semantic Web Conference accepts a set of Semantic Web Challenges to establish competitions that will advance the state of the art solutions in any given problem domain. The SeMantic AnsweR Type prediction task (SMART) was part of ISWC 2020 challenges. Question type and answer type prediction can play a key role in knowledge base question answering systems providing insi… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  27. arXiv:2009.10331   

    cs.DB

    Storage, Indexing, Query Processing, and Benchmarking in Centralized and Distributed RDF Engines: A Survey

    Authors: Waqas Ali, Muhammad Saleem, Bin Yao, Aidan Hogan, Axel-Cyrille Ngonga Ngomo

    Abstract: The recent advancements of the Semantic Web and Linked Data have changed the working of the traditional web. There is significant adoption of the Resource Description Framework (RDF) format for saving of web-based data. This massive adoption has paved the way for the development of various centralized and distributed RDF processing engines. These engines employ various mechanisms to implement crit… ▽ More

    Submitted 23 September, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: reference list is been updated

  28. arXiv:2009.07728  [pdf, other

    cs.CL

    NABU $\mathrm{-}$ Multilingual Graph-based Neural RDF Verbalizer

    Authors: Diego Moussallem, Dwaraknath Gnaneshwar, Thiago Castro Ferreira, Axel-Cyrille Ngonga Ngomo

    Abstract: The RDF-to-text task has recently gained substantial attention due to continuous growth of Linked Data. In contrast to traditional pipeline models, recent studies have focused on neural models, which are now able to convert a set of RDF triples into text in an end-to-end style with promising results. However, English is the only language widely targeted. We address this research gap by presenting… ▽ More

    Submitted 21 September, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: International Semantic Web Conference (ISWC) 2020

  29. arXiv:2009.06625  [pdf, other

    cs.DB cs.AI

    Revealing Secrets in SPARQL Session Level

    Authors: Xinyue Zhang, Meng Wang, Muhammad Saleem, Axel-Cyrille Ngonga Ngomo, Guilin Qi, Haofen Wang

    Abstract: Based on Semantic Web technologies, knowledge graphs help users to discover information of interest by using live SPARQL services. Answer-seekers often examine intermediate results iteratively and modify SPARQL queries repeatedly in a search session. In this context, understanding user behaviors is critical for effective intention prediction and query optimization. However, these behaviors have no… ▽ More

    Submitted 1 November, 2020; v1 submitted 13 September, 2020; originally announced September 2020.

    Comments: 18 pages. Accepted by ISWC 2020

  30. arXiv:2008.13544  [pdf, other

    cs.CL cs.AI cs.IR cs.LG stat.ML

    I-AID: Identifying Actionable Information from Disaster-related Tweets

    Authors: Hamada M. Zahera, Rricha Jalota, Mohamed A. Sherif, Axel N. Ngomo

    Abstract: Social media plays a significant role in disaster management by providing valuable data about affected people, donations and help requests. Recent studies highlight the need to filter information on social media into fine-grained content labels. However, identifying useful information from massive amounts of social media posts during a crisis is a challenging task. In this paper, we propose I-AID,… ▽ More

    Submitted 18 May, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

  31. arXiv:2008.03130  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Convolutional Complex Knowledge Graph Embeddings

    Authors: Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: In this paper, we study the problem of learning continuous vector representations of knowledge graphs for predicting missing links. We present a new approach called ConEx, which infers missing links by leveraging the composition of a 2D convolution with a Hermitian inner product of complex-valued embedding vectors. We evaluate ConEx against state-of-the-art approaches on the WN18RR, FB15K-237, KIN… ▽ More

    Submitted 9 June, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

  32. arXiv:2004.13843  [pdf, other

    cs.CL cs.DB cs.LG

    Template-based Question Answering using Recursive Neural Networks

    Authors: Ram G Athreya, Srividya Bansal, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck

    Abstract: We propose a neural network-based approach to automatically learn and classify natural language questions into its corresponding template using recursive neural networks. An obvious advantage of using neural networks is the elimination of the need for laborious feature engineering that can be cumbersome and error-prone. The input question is encoded into a vector representation. The model is train… ▽ More

    Submitted 8 June, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

  33. arXiv:2003.02320  [pdf, other

    cs.AI cs.DB cs.LG

    Knowledge Graphs

    Authors: Aidan Hogan, Eva Blomqvist, Michael Cochez, Claudia d'Amato, Gerard de Melo, Claudio Gutierrez, José Emilio Labra Gayo, Sabrina Kirrane, Sebastian Neumaier, Axel Polleres, Roberto Navigli, Axel-Cyrille Ngonga Ngomo, Sabbir M. Rashid, Anisa Rula, Lukas Schmelzeisen, Juan Sequeda, Steffen Staab, Antoine Zimmermann

    Abstract: In this paper we provide a comprehensive introduction to knowledge graphs, which have recently garnered significant attention from both industry and academia in scenarios that require exploiting diverse, dynamic, large-scale collections of data. After some opening remarks, we motivate and contrast various graph-based data models and query languages that are used for knowledge graphs. We discuss th… ▽ More

    Submitted 11 September, 2021; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Revision from v5: Correcting errata from previous version for entailment/models, and some other minor typos

    Journal ref: ACM Comput. Surv. 54(4): 71:1-71:37 (2021)

  34. arXiv:2002.06039  [pdf, other

    cs.DB

    Benchmarking Knowledge Graphs on the Web

    Authors: Michael Röder, Mohamed Ahmed Sherif, Muhammad Saleem, Felix Conrads, Axel-Cyrille Ngonga Ngomo

    Abstract: The growing interest in making use of Knowledge Graphs for developing explainable artificial intelligence, there is an increasing need for a comparable and repeatable comparison of the performance of Knowledge Graph-based systems. History in computer science has shown that a main driver to scientific advances, and in fact a core element of the scientific method as a whole, is the provision of benc… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

  35. A Physical Embedding Model for Knowledge Graphs

    Authors: Caglar Demir, Axel-Cyrille Ngonga Ngomo

    Abstract: Knowledge graph embedding methods learn continuous vector representations for entities in knowledge graphs and have been used successfully in a large number of applications. We present a novel and scalable paradigm for the computation of knowledge graph embeddings, which we dub PYKE . Our approach combines a physical model based on Hooke's law and its inverse with ideas from simulated annealing to… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: 9th Joint International Conference, JIST 2019, Hangzhou, China

  36. arXiv:1912.08026  [pdf, other

    cs.DB cs.PF

    ORCA: a Benchmark for Data Web Crawlers

    Authors: Michael Röder, Geraldo de Souza, Denis Kuchelev, Abdelmoneim Amer Desouki, Axel-Cyrille Ngonga Ngomo

    Abstract: The number of RDF knowledge graphs available on the Web grows constantly. Gathering these graphs at large scale for downstream applications hence requires the use of crawlers. Although Data Web crawlers exist, and general Web crawlers could be adapted to focus on the Data Web, there is currently no benchmark to fairly evaluate their performance. Our work closes this gap by presenting the Orca benc… ▽ More

    Submitted 29 October, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: 8 pages, submitted to a conference

  37. arXiv:1911.01248  [pdf, other

    cs.CL cs.DB

    A Holistic Natural Language Generation Framework for the Semantic Web

    Authors: Axel-Cyrille Ngonga Ngomo, Diego Moussallem, Lorenz Bühmann

    Abstract: With the ever-growing generation of data for the Semantic Web comes an increasing demand for this data to be made available to non-semantic Web experts. One way of achieving this goal is to translate the languages of the Semantic Web into natural language. We present LD2NL, a framework for verbalizing the three key languages of the Semantic Web, i.e., RDF, OWL, and SPARQL. Our framework is based o… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Comments: International Conference Recent Advances in Natural Language Processing

  38. arXiv:1907.10676  [pdf, ps, other

    cs.CL

    Semantic Web for Machine Translation: Challenges and Directions

    Authors: Diego Moussallem, Matthias Wauer, Axel-Cyrille Ngonga Ngomo

    Abstract: A large number of machine translation approaches have recently been developed to facilitate the fluid migration of content across languages. However, the literature suggests that many obstacles must still be dealt with to achieve better automatic translations. One of these obstacles is lexical and syntactic ambiguity. A promising way of overcoming this problem is using Semantic Web technologies. T… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: Accepted at the Journal track of International Semantic Web conference (ISWC) 2019. arXiv admin note: substantial text overlap with arXiv:1711.09476

  39. arXiv:1903.10326  [pdf, other

    cs.DS

    topFiberM: Scalable and Efficient Boolean Matrix Factorization

    Authors: Abdelmoneim Amer Desouki, Michael Röder, Axel-Cyrille Ngonga Ngomo

    Abstract: Matrix Factorization has many applications such as clustering. When the matrix is Boolean it is favorable to have Boolean factors too. This will save the efforts of quantizing the reconstructed data back, which usually is done using arbitrary thresholds. Here we introduce topFiberM a Boolean matrix factorization algorithm. topFiberM chooses in a greedy way the fibers (rows or columns) to represent… ▽ More

    Submitted 5 March, 2019; originally announced March 2019.

    Comments: 9 pages, 1 Figure, 3 tables

  40. arXiv:1902.08816  [pdf, other

    cs.CL

    Augmenting Neural Machine Translation with Knowledge Graphs

    Authors: Diego Moussallem, Mihael Arčan, Axel-Cyrille Ngonga Ngomo, Paul Buitelaar

    Abstract: While neural networks have been used extensively to make substantial progress in the machine translation task, they are known for being heavily dependent on the availability of large amounts of training data. Recent efforts have tried to alleviate the data sparsity problem by augmenting the training data using different strategies, such as back-translation. Along with the data scarcity, the out-of… ▽ More

    Submitted 23 February, 2019; originally announced February 2019.

  41. arXiv:1805.11467  [pdf, other

    cs.CL

    Entity Linking in 40 Languages using MAG

    Authors: Diego Moussallem, Ricardo Usbeck, Michael Röder, Axel-Cyrille Ngonga Ngomo

    Abstract: A plethora of Entity Linking (EL) approaches has recently been developed. While many claim to be multilingual, the MAG (Multilingual AGDISTIS) approach has been shown recently to outperform the state of the art in multilingual EL on 7 languages. With this demo, we extend MAG to support EL in 40 different languages, including especially low-resources languages such as Ukrainian, Greek, Hungarian, C… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: Accepted at ESWC 2018

  42. arXiv:1802.08150  [pdf, other

    cs.CL

    RDF2PT: Generating Brazilian Portuguese Texts from RDF Data

    Authors: Diego Moussallem, Thiago Castro Ferreira, Marcos Zampieri, Maria Claudia Cavalcanti, Geraldo Xexéo, Mariana Neves, Axel-Cyrille Ngonga Ngomo

    Abstract: The generation of natural language from Resource Description Framework (RDF) data has recently gained significant attention due to the continuous growth of Linked Data. A number of these approaches generate natural language in languages other than English, however, no work has been proposed to generate Brazilian Portuguese texts out of RDF. We address this research gap by presenting RDF2PT, an app… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

    Comments: Accepted for publication in Language Resources and Evaluation Conference (LREC) 2018

  43. arXiv:1802.08148  [pdf, other

    cs.CL

    LIDIOMS: A Multilingual Linked Idioms Data Set

    Authors: Diego Moussallem, Mohamed Ahmed Sherif, Diego Esteves, Marcos Zampieri, Axel-Cyrille Ngonga Ngomo

    Abstract: In this paper, we describe the LIDIOMS data set, a multilingual RDF representation of idioms currently containing five languages: English, German, Italian, Portuguese, and Russian. The data set is intended to support natural language processing applications by providing links between idioms across languages. The underlying data was crawled and integrated from various sources. To ensure the quality… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

    Comments: Accepted for publication in Language Resources and Evaluation Conference (LREC) 2018

  44. arXiv:1802.03638  [pdf, other

    cs.DB cs.AI

    Beyond Markov Logic: Efficient Mining of Prediction Rules in Large Graphs

    Authors: Tommaso Soru, André Valdestilhas, Edgard Marx, Axel-Cyrille Ngonga Ngomo

    Abstract: Graph representations of large knowledge bases may comprise billions of edges. Usually built upon human-generated ontologies, several knowledge bases do not feature declared ontological rules and are far from being complete. Current rule mining approaches rely on schemata or store the graph in-memory, which can be unfeasible for large graphs. In this paper, we introduce HornConcerto, an algorithm… ▽ More

    Submitted 13 February, 2018; v1 submitted 10 February, 2018; originally announced February 2018.

    Comments: 13 pages, 4 figures

    ACM Class: G.3.8; E.1.3

  45. Machine Translation using Semantic Web Technologies: A Survey

    Authors: Diego Moussallem, Matthias Wauer, Axel-Cyrille Ngonga Ngomo

    Abstract: A large number of machine translation approaches have recently been developed to facilitate the fluid migration of content across languages. However, the literature suggests that many obstacles must still be dealt with to achieve better automatic translations. One of these obstacles is lexical and syntactic ambiguity. A promising way of overcoming this problem is using Semantic Web technologies. T… ▽ More

    Submitted 17 July, 2018; v1 submitted 26 November, 2017; originally announced November 2017.

    Comments: 23 pages, 2 figures, 4 tables

  46. arXiv:1711.01283  [pdf, other

    cs.DB cs.AI

    Mandolin: A Knowledge Discovery Framework for the Web of Data

    Authors: Tommaso Soru, Diego Esteves, Edgard Marx, Axel-Cyrille Ngonga Ngomo

    Abstract: Markov Logic Networks join probabilistic modeling with first-order logic and have been shown to integrate well with the Semantic Web foundations. While several approaches have been devised to tackle the subproblems of rule mining, grounding, and inference, no comprehensive workflow has been proposed so far. In this paper, we fill this gap by introducing a framework called Mandolin, which implement… ▽ More

    Submitted 3 November, 2017; originally announced November 2017.

    Comments: 6 pages

    ACM Class: G.3.8; E.1.3

  47. arXiv:1710.08691  [pdf, other

    cs.CL

    BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking

    Authors: Axel-Cyrille Ngonga Ngomo, Michael Röder, Diego Moussallem, Ricardo Usbeck, René Speck

    Abstract: The manual creation of gold standards for named entity recognition and entity linking is time- and resource-intensive. Moreover, recent works show that such gold standards contain a large proportion of mistakes in addition to being difficult to maintain. We hence present BENGAL, a novel automatic generation of such gold standards as a complement to manually created benchmarks. The main advantage o… ▽ More

    Submitted 1 November, 2018; v1 submitted 24 October, 2017; originally announced October 2017.

    Comments: Accepted at INLG 2018

  48. arXiv:1710.08634  [pdf, other

    cs.IR cs.CL

    Using Multi-Label Classification for Improved Question Answering

    Authors: Ricardo Usbeck, Michael Hoffmann, Michael Röder, Jens Lehmann, Axel-Cyrille Ngonga Ngomo

    Abstract: A plethora of diverse approaches for question answering over RDF data have been developed in recent years. While the accuracy of these systems has increased significantly over time, most systems still focus on particular types of questions or particular challenges in question answering. What is a curse for single systems is a blessing for the combination of these systems. We show in this paper how… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: 15 pages, 4 Tables, 3 Figues

  49. MAG: A Multilingual, Knowledge-base Agnostic and Deterministic Entity Linking Approach

    Authors: Diego Moussallem, Ricardo Usbeck, Michael Röder, Axel-Cyrille Ngonga Ngomo

    Abstract: Entity linking has recently been the subject of a significant body of research. Currently, the best performing approaches rely on trained mono-lingual models. Porting these approaches to other languages is consequently a difficult endeavor as it requires corresponding training data and retraining of the models. We address this drawback by presenting a novel multilingual, knowledge-based agnostic a… ▽ More

    Submitted 17 October, 2017; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: Accepted in K-CAP 2017: Knowledge Capture Conference

    ACM Class: I.2.7

  50. ROCKER: A Refinement Operator for Key Discovery

    Authors: Tommaso Soru, Edgard Marx, Axel-Cyrille Ngonga Ngomo

    Abstract: The Linked Data principles provide a decentral approach for publishing structured data in the RDF format on the Web. In contrast to structured data published in relational databases where a key is often provided explicitly, finding a set of properties that allows identifying a resource uniquely is a non-trivial task. Still, finding keys is of central importance for manifold applications such as re… ▽ More

    Submitted 11 May, 2017; originally announced May 2017.

    Comments: WWW 2015

    MSC Class: 68W99 ACM Class: H.4.M; I.2.8