<?xml version="1.0"?>
<article xlink="http://www.w3.org/1999/xlink" mml="http://www.w3.org/1998/Math/MathML" xsi="http://www.w3.org/2001/XMLSchema-instance" ali="http://www.niso.org/schemas/ali/1.0/" noNamespaceSchemaLocation="http://jats.nlm.nih.gov/publishing/1.1/xsd/JATS-journalpublishing1-mathml3.xsd" article-type="research-article" dtd-version="1.1" lang="en"><front><journal-meta><journal-id journal-id-type="publisher-id">isrdo-SRJSET</journal-id><journal-id journal-id-type="pmc">isrdo-SRJSET</journal-id><journal-id journal-id-type="nlm-ta">isrdo-SRJSET</journal-id><journal-title-group><journal-title>Scientific Research Journal of Science, Engineering and Technology</journal-title><abbrev-journal-title abbrev-type="publisher" pub-type="epub">SRJSET</abbrev-journal-title></journal-title-group><issn>2584-0584</issn><publisher><publisher-name>ISRDO</publisher-name><publisher-loc>Gujarat,India</publisher-loc></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">M-10132</article-id><article-id pub-id-type="doi"/><article-categories><subj-group subj-group-type="categories"><subject>Information Technology</subject></subj-group></article-categories><title-group><article-title>Transforming Data Warehouses into Dynamic Knowledge Bases for RAG</article-title></title-group><contrib-group content-type="authors"><contrib id="186" contrib-type="author" corresp="yes"><name><given-names>Gerry Hosea</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><aff id="aff-1"><label>0</label><institution>University of North Sumatra, Medan</institution><country>Indonesia</country></aff></contrib><contrib id="187" contrib-type="author" corresp="yes"><name><given-names>Hari Sudrajat</given-names></name><xref ref-type="aff" rid="aff-2">2</xref><aff id="aff-2"><label>1</label><institution>TRT Solution Limited</institution><country>Indonesia</country></aff></contrib></contrib-group><contrib-group content-type="editors"><contrib contrib-type="editor"/></contrib-group><pub-date pub-type="epub" data-type="pub" iso-8601-date="2024-07-30"><day>30</day><month>07</month><year iso-8601-date="2">2024</year></pub-date><volume>2</volume><elocation-id>V2-I1-2024</elocation-id><history><date date-type="received" iso-8601-date="2024-07-25"><day>25</day><month>07</month><year iso-8601-date="2024">2024</year></date><date date-type="revised" iso-8601-date="2024-07-28"><day>28</day><month>07</month><year iso-8601-date="2024"/></date><date date-type="accepted" iso-8601-date="2024-07-28"><day>28</day><month>07</month><year iso-8601-date="2024"/></date></history><permissions><copyright-statement>&#xA9;2024 Hari Sudrajat Year Corresponding Author</copyright-statement><copyright-year>2024</copyright-year><copyright-holder>Hari Sudrajat</copyright-holder><license href="https://creativecommons.org/licenses/by/4.0/"><license-p>This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (ISRDO) and either DOI or URL of the article must be cited.<ext-link ext-link-type="uri" href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License</ext-link></license-p></license></permissions><self-uri href="https://isrdo.org/journal/SRJSET/currentissue/transforming-data-warehouses-into-dynamic-knowledge-bases-for-rag"/><abstract><p>It is necessary to include data warehouses in contemporary data processing frameworks to provide comprehensive support for efficient decision-making procedures. This study aims to evaluate the exploitation of data warehouses as a knowledge base inside a Retrieval-Augmented Generation (RAG) model. This model combines retrieval mechanisms with generative models to improve information retrieval and response generation in artificial intelligence systems. Several necessary procedures are the subject of this research. These procedures include the preparation of data via the use of Databricks, the production of online tables, and the transformation of these tables into embeddings. Databricks offers a robust data engineering platform, enabling practical data input, cleaning, and structuring into Delta tables. This is followed by building online tables, making it easier to get data quickly. They are then converted into embeddings, which can capture the semantic substance of the data. These online tables are subsequently altered. The embeddings are kept in a repository, and the RAG model makes use of them to create replies consistent with the context in which they are being used. RAG models can efficiently harness enormous data repositories, as shown by the findings of this research, which reveal considerable increases in the speed at which data is retrieved and the precision of responses. By implementing best practices and using Databricks' capabilities, businesses can improve their AI-driven decision-making processes. This approach is advantageous for various purposes, including customer assistance, data analysis, and strategic planning. By doing more study in the future, it will be possible to investigate the applicability of this technique across a variety of domains and the incorporation of sophisticated generative models to enhance performance.</p></abstract><kwd-group kwd-group-type="author"><kwd>Data Warehouses</kwd><kwd> Knowledge Base</kwd><kwd> Retrieval-Augmented Generation</kwd><kwd> Databricks</kwd><kwd> Embeddings</kwd><kwd> AI Response Generation</kwd></kwd-group><funding-group><funding-statement>This research, including authorship and publication, did not receive any specific grant from funding agencies in the public, commercial, or non-profit sectors.</funding-statement></funding-group></article-meta></front><back><sec sec-type="data-availability"><title>Data Availability</title><p>Data sharing is not applicable to this article.</p></sec><sec sec-type="COI-statement"><title>Conflicts of Interest</title><p>All authors declare the absence of any conflicts of interest.</p></sec><sec sec-type="author-contributions"><title>Authors&#x2019; Contributions</title><p>All study-related tasks, from conception and design to data analysis and manuscript creation, were solely managed by the author.</p></sec><sec sec-type="funding-statement"><title>Funding Statement</title><p>This research, including authorship and publication, did not receive any specific grant from funding agencies in the public, commercial, or non-profit sectors.</p></sec><sec sec-type="software-information"><title>software-information</title><p>No specific software or tools were used in the research.</p></sec><ack><title>Acknowledgments</title><p>I appreciate the support and expertise of everyone who contributed to this research and manuscript writing, as well as the insightful comments from anonymous reviewers.</p></ack><ref-list content-type="authoryear"><ref id="1"><label>1</label><element-citation publication-type="journal"><p>-</p></element-citation></ref></ref-list></back></article>
