Confidential — Stefan Michaelcheck Only

Solved Status: 850 Problems

GPT-5.5-pro analyzed every identified research problem in the KG field. Not a single one is fully solved.

761
Active (89.5%)
55
Superseded (6.5%)
34
Partial (4%)
0
Fully Solved
0.75
Avg Confidence
Zero problems are fully solved or abandoned. The entire KG research field is in active evolution — problems get partially addressed or reframed, but never fully closed. LLM-related problems (hallucination, knowledge limitations) are the fastest-growing, appearing only since 2021.
Top Active Problems (by paper count)
ProblemPapersYearsStatusConf
kg incompleteness
KG incompleteness remains a fundamental open-world problem: embedding-based knowledge graph completion, open-world KGC benchmarks, representation lear...
1,72619952026active0.93
kg question answering
KG question answering has seen substantial progress through semantic parsing, entity-linking pipelines, neural retrieval/reasoning models, KGQA system...
34820152026active0.92
automated knowledge graph construction
Automated knowledge graph construction has progressed through extraction/linking pipelines, domain-specific construction methods such as course knowle...
30320092026active0.90
knowledge graph embedding learning
Knowledge graph embedding learning has become a mature and widely used approach for KG completion and link prediction, with standard families such as ...
23320152026active0.88
llm hallucination
LLM hallucination in KG contexts is being actively mitigated through KG-enhanced LLMs, KG-enhanced RAG, graph reasoning, QA systems, and agentic retri...
19720232026active0.92
temporal kg reasoning
Temporal KG reasoning has seen strong progress through temporal KG embeddings, attention/GNN-based temporal encoders, subgraph extraction, adaptive te...
18420162026active0.90
high-dimensional kge cost
High-dimensional KGE cost has been attacked through tensor/matrix factorization, more efficient training frameworks, convolutional or dynamic embeddin...
18120142026active0.84
kg recommendation data sparsity
Knowledge graphs have clearly helped alleviate recommender-system data sparsity by injecting entity relations, attributes, paths, and neighborhoods in...
16020162026active0.86
llm knowledge limitations
LLM knowledge limitations remain an active problem: KG-enhanced LLMs, KG-enhanced RAG, graph retrieval, prompting, and KG reasoning have improved fact...
14120212026active0.88
kg link prediction
KG link prediction has made substantial progress through knowledge graph embeddings, tensor/matrix factorization, GNN-based models, temporal regulariz...
13020152026active0.88
data sparsity and cold start
KG-based recommendation has become the dominant mitigation strategy for sparsity and cold start, using entity relations, embeddings, graph attention/p...
13020162026active0.89
heterogeneous data integration
Heterogeneous data integration remains an open KG problem: ontology/semantic integration, KG construction pipelines, embeddings/representation learnin...
12720052026active0.87
domain-specific kg construction challenges
The literature has moved from conventional domain-specific construction pipelines, graph databases, and manually engineered schemas toward LLM-assiste...
11220162026active0.82
kg entity alignment
KG entity alignment has seen substantial methodological progress, moving from embedding and GCN/attention-based approaches toward hierarchical alignme...
9420182026active0.88
rag knowledge conflicts
RAG knowledge conflicts are a very recent and rapidly growing problem, especially where biomedical answers must reconcile retrieved evidence, knowledg...
9420242026active0.78
multi-hop kg reasoning
Multi-hop KG reasoning remains an active open problem: early RL/path-based methods and later GNN, subgraph, noise-aware, KGQA, and LLM-augmented appro...
8920172026active0.91
static kg temporal limitation
The community has mainly addressed static KG temporal limitations by moving toward temporal knowledge graphs and time-aware embedding or tensor models...
7620152026active0.84
llm hallucination in kgc
LLM hallucination in knowledge graph completion has become a sharply active problem as LLMs are increasingly used to predict missing KG facts, generat...
7520202026active0.88
joint entity-relation extraction
Joint entity-relation extraction has seen substantial progress from neural joint models, especially transformer-based span, table-filling, sequence-to...
7520082026active0.88
large-scale knowledge graph management
Large-scale knowledge graph management has seen incremental techniques such as scalable reasoning, distributed KG use, embedding variants, corpus augm...
6520152026active0.78
Partially Solved (34)
ProblemPapersKey SolutionYearConf
cf sparsity and cold start26knowledge-graph collaborative filtering with knowledge-enhanced GNNs and relation-aware attention propagation20210.78
multi-modal kg resource need24Multimodal KG construction and enrichment using DBpedia/Wikidata extensions, domain-specific MMKG construction, and multimodal knowledge fusion/representation learning20220.67
up-to-date biomedical knowledge access17automated biomedical knowledge graph construction using biomedical NER/relation extraction, disease-specific KG generators, and KG-enhanced retrieval interfaces20210.72
biodiversity data integration14domain-specific biodiversity knowledge graphs with FAIR/global identifier reconciliation20180.74
author name disambiguation13metadata-rich, graph-based author identity resolution using multimodal literal features, bibliometric/semantic KG context, and curated evaluation datasets20210.68
research hotspot identification13CiteSpace-based bibliometric knowledge graph analysis with co-citation, keyword co-occurrence, burst detection, and visualization20060.76
traditional recommender limitations12KG-enhanced recommendation using knowledge graph embeddings and GNN-based reasoning20190.72
heterogeneous vulnerability data management12ontology- and RDF/SPARQL-based domain knowledge graphs, often reusing Wikidata or domain ontologies20200.68
human disease mechanism representation10PheKnowLator-style biomedical knowledge graph construction with ontology/data-source integration and OWL reasoning closure20210.72
collaborative filtering limitations9knowledge graph-enhanced collaborative filtering / knowledge graph-based recommendation20200.74
relation pattern modeling8RotatE-style relational rotation embeddings with self-adversarial negative sampling20190.78
heterogeneous climate data integration7linked-data/RDF climate knowledge graph platforms with SPARQL endpoints, graph databases, and emerging virtual knowledge graph integration20220.72
scalable reasoning integration7restricted-fragment scalable reasoning, especially OWL 2 RL/Datalog-style distributed materialization, with probabilistic soft logic for soft or uncertain inference20160.66
seed-dependent entity alignment7semi-supervised seed expansion and one-to-one matching, later complemented by LLM/text-based entity alignment20210.72
distant supervision label noise7bag-level multi-instance learning with attention/instance selection, often combined with relation embeddings20160.78
medical named entity recognition7BERT-based neural sequence labeling, especially BERT-BiLSTM-CRF and KG-infused BERT variants20200.74
systematic review need6systematic literature reviews and paradigm-comparison/resource-brief frameworks20230.68
drug discovery data integration6biomedical knowledge graph construction with RDF/SPARQL access, exemplified by OREGANO drug knowledge graph resources20230.68
incomplete type constraints6local closed-world type approximation, later complemented by soft type-constrained latent and embedding models20150.68
knowledge noise in kg-enhanced text6K-BERT-style knowledge injection with soft-position embeddings and visible-matrix attention masking20190.68
missing semantic hierarchy modeling6HAKE-style polar-coordinate / radial-angular hierarchy-aware embeddings20190.74
semantic scholarly querying6Open Research Knowledge Graph (ORKG) infrastructure with semantic scholarly APIs and structured contribution modeling20180.72
translation kge misses relation patterns6relation-pattern-aware KGE models, especially RotatE/HRotatE-style rotation embeddings and rule-augmented embedding methods20190.72
using side information for recommendation5KG-enhanced recommendation using multi-task feature learning, especially MKR-style cross-compress feature sharing between recommendation and KG representation learning20190.70
power equipment information retrieval5domain-specific power knowledge graph construction with a semantic retrieval layer20190.62
knowledge graph querying5standardized graph query languages and optimized graph/RDF query engines, especially SPARQL/SPARQL 1.1, triplestores, graph-database APIs, and abstract query-interface layers20130.68
loss of kg structure in linearized encoders5structure-preserving graph encoders, especially bidirectional Graph2Seq/subgraph-GCN encoders with node-level copy decoders20190.64
scalable rdf knowledge graph creation from complex data5RML-family declarative mappings with scalable RDFizer engines such as SDM-RDFizer and optimized logical-operator execution20200.70
multi-domain dialogue state tracking5DST-as-question-answering with turn-level domain-slot questions, augmented by dynamic or conversational knowledge-graph state representations20190.62
user-item graph insufficiency5knowledge graph-based recommendation with KG-enhanced user/item representations, propagation/attention, and contrastive-learning variants20220.68
no datasets for video kg extraction5video KG extraction task formulations with heterogeneous video knowledge graph datasets20200.68
traditional database limitations5native graph databases and RDF/graph stores, especially Neo4j graph storage, combined with graph-specific indexing, pruning, and query optimization20150.64
ned-based retrieval limitation5CLOCQ-style top-k KB item retrieval with KB-aware pruning20210.68
persistent dataset metadata and citation5JSON-LD knowledge-graph dataset descriptions paired with persistent DOI metadata records20260.68
Superseded (55)
ProblemPapersWhat HappenedConf
scholarly kg knowledge discovery34The standalone problem of doing knowledge discovery directly over scholarly KGs has not converged on a standard solution: most work is proof-of-concept, with small clusters around semantic graph parti0.72
incomplete knowledge bases21The underlying issue of incomplete knowledge bases has not been solved, but the problem label has largely been absorbed into knowledge graph completion, link prediction, entity linking, cross-lingual 0.74
triple-independent kg embeddings16The limitation of treating KG triples as independent has been mitigated by coupled tensor-matrix factorization, latent imputation, neighborhood attention, multi-hop reasoning, and rule/logic-enhanced 0.74
external knowledge underuse14Work on external knowledge underuse mainly tried KG/GNN-style augmentation, including syntax-knowledge GCNs, KG representations and embeddings, heterogeneous document graphs, entity-comparison network0.72
knowledge base completion13Knowledge base completion was not truly solved; embedding, classification, co-occurrence, multilingual, hyperbolic, and GNN-based methods improved benchmark link prediction but did not eliminate open-0.78
natural language question understanding11As a stand-alone KGQA problem, natural language question understanding has not been resolved by a single broadly adopted solution. The listed work is mostly complete KGQA systems, domain-specific know0.72
target-specific kg exploitation11Target-specific KG exploitation saw a small burst of work around 2020–2022 using target-specific subgraph distillation, knowledge-aware attention, KG embeddings, and entity-description augmentation, b0.68
pipeline error propagation10Pipeline error propagation has not been cleanly solved as a standalone entity-linking problem; instead, it has been mitigated through joint/end-to-end EL and KGQA models, zero-shot evaluation splits, 0.72
covid-19 information overload10COVID-19 information overload was addressed mainly through COVID-specific knowledge graph applications, Neo4j-backed graph stores, biomedical entity enrichment, and discovery frameworks, which helped 0.72
covid-19 drug repurposing9COVID-19 drug repurposing with knowledge graphs was mainly an urgent early-pandemic application: groups built or merged biomedical KGs, selected semantic predications, and applied probabilistic reason0.82
document-based scholarly communication8The original framing—moving scholarly communication from static documents to machine-actionable KG-based artifacts—had a small burst of work around 2019, mainly around ORKG-style scholarly KG infrastr0.74
entity-pair relation prediction8Entity-pair relation prediction had a small burst of work around 2018-2019 using latent path modeling, structural supervised methods, variational architectures such as DiVa, attention embeddings such 0.74
scalable semantic web kr8The exact framing of "scalable semantic web KR" shows little sustained activity: after one 2008 paper, it reappeared sparsely from 2018 to 2022, mostly as representation, benchmark, survey, visualizat0.74
specific disease kg reasoning gap8Early work tried to close disease-specific KG reasoning gaps through NER/relation-extraction benchmarks, SDKG construction pipelines/resources, curated schema frameworks, and embedding or tensor-based0.66
language-specific relation extraction limitation8As a standalone KG extraction problem, language-specific relation extraction limitation has not been convincingly solved; the proposed fixes are one-off supervised resources, transfer experiments, fea0.72