Ground Truth, Monitoring, and Database-Oriented AI: A Review of Labelling, Hallucination, Feature Stores, Code Retrieval, Cybersecurity, and Analytical Databases

Paula  Martin; Javier  Ortega; Sofia  Campos

doi:10.63646/datamind.2025.030406

Open Access PDF

Received 2025-07-16

Accepted 2025-11-23

Published 2025-12-30

Paula Martin

Department of Computer Science, University of Oviedo, Oviedo 33003, Spain

Javier Ortega*

Department of Computer Architecture and Technology, University of Girona, Girona 17003, Spain
javier.ortega@udg.edu

Sofia Campos

Department of Information Systems and Technologies, University of Castilla-La Mancha, Ciudad Real 13071, Spain

DOI: https://doi.org/10.63646/datamind.2025.030406

Abstract

DATAMIND's 2025 volume centers on knowledge integrity in database-oriented artificial intelligence. This review synthesizes all DATAMIND articles published in 2025 and links them to eighty DOI-bearing references on hallucination measurement, retrieval, feature stores, code intelligence, cybersecurity analytics, data labelling, privacy, workload management, and analytical database benchmarking. A structured evidence-mapping design codes each article by evidence source, data asset, failure mode, governance intervention, and downstream user. The resulting analysis shows that the 2025 corpus is unified by a concern with whether AI outputs can be traced to reliable data, validated against external facts, reviewed by humans, and used in operational or policy settings. Hallucination metrics make factuality measurable but context-dependent; feature stores preserve training-serving consistency; workload analytics exposes the computational conditions of LLM serving; code search reveals benchmark realism problems; cybersecurity analytics converts alerts into evidence; trade database benchmarking shows that database selection is methodological; and labelling research returns the field to ground-truth production. The article contributes two grayscale figures and three tables, including a same-year corpus summary, an integrity rubric, and a research agenda. It concludes that DATAMIND is moving from database-centered AI toward evidence-centered computational discovery.

Keywords: Knowledge integrity; hallucination; feature stores; neural code search; cybersecurity analytics; data labelling; database benchmarking

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Martin, P. ., Ortega, J. ., & Campos, S. (2025). Ground Truth, Monitoring, and Database-Oriented AI: A Review of Labelling, Hallucination, Feature Stores, Code Retrieval, Cybersecurity, and Analytical Databases. DATAMIND, 3(4), 86-102. https://doi.org/10.63646/datamind.2025.030406

Download Citation

Article sidebar

Main article

Abstract

Article details

How to Cite