A digital display showcasing a graphrag over a knowledge graph with interconnected nodes and data visualizations in a modern office environment.

Graphwise and its partners have done quite a bit of deep thinking about how to justify your investment in the most trustworthy form of AI. In this post, I’ll highlight a few of the main points and make the case for how important a semantics technology vantage point is to data quality and overall enterprise AI success.

Want to avoid hallucinations? Then you’ll want to make sure the data + content + knowledge foundation you’ve built internally to feed your AI is logically connected and fully disambiguated. When it comes to transactions and regulatory compliance in particular, there is no alternative reality. That’s why your data sourcing should be holistic and deterministic. When it comes to data, you’ll need certainty.

Prioritize Interconnected, Hybrid AI above Monolithic, Statistical Machine Learning

Generative AI has triggered a groundswell of interest in AI, but early adopters have learned that large language models hallucinate often–they assert information fabricated as factual when it’s not.

Want to avoid hallucinations? Then you’ll want to make sure the data + content + knowledge foundation you’ve built internally to feed your AI  is logically connected and fully disambiguating. When it comes to transactions and regulatory compliance in particular, there is no alternative reality. That’s why your data sourcing should be holistic and deterministic. When it comes to data, you’ll need certainty.  

Determinism is actually a benefit built into what academics call “neurosymbolic” AI (NSAI), the dominant form of hybrid AI. NSAI blends the probabilistic approach of statistical neural networks—essential for the dynamic chat and agentic interface large language models (LLMs) are known for—with deterministic retrieval from databases. Graph databases and semantic, contextualizing metadata–the symbolic side of the AI–make a trustworthy data foundation across all data, content and knowledge both possible and scalable. 

Deterministic and probabilistic methods both have their place. Both belong together in a single, unified AI system because they’re symbiotic, with the probabilistic side able to guess the next word or predict a failure, and the deterministic side retrieving facts and enforcing rules that demand certainty. The right metadata provides the glue to articulate and unify business contexts in such a system.

The Symbiotic Nature of a Hybrid AI System

GraphRAG as Graphwise defines it—semantic graph database-oriented retrieval augmented generation—is by definition an NSAI approach. It’s been the most effective means of hybrid AI, for good reason.

What is Retrieval Augmented Generation?

A diagram illustrating the process of graph Retrieval Augmented Generation (graphRAG), showing interactions between a User, a Language Learning Model (LLM), and a Data Source, highlighting the flow of information and context used to enhance model outputs.

“Successful GenAI implementations,” says Sumit Pal, Strategic Technology Director   at Graphwise, “are fueled with necessary domain knowledge and context, and knowledge graphs provide the ideal platform to connect disparate data across systems semanIf your enterprise is like most, there will be multiple AI initiatives afoot. 

Want to make sure you’re not wasting budget on frivolous AI ventures? Then eliminate the monolithic, ill-informed approaches that merely use statistical machine learning and vector-based RAG. Your budget should be spent on a method that delivers trustworthy AI. The alternatives will hallucinate and, by doing so, will waste your employees’ time.tically.”

Keep in mind that almost all of the thousands of AI startups who are venture funded aren’t worth the investment.

Make your Data Interconnections Machine- and Human-Readable across all Formats

Growth VP Andreas Blumauer of Graphwise makes a great point about the value of bringing different kinds of data together, context by context, and then harnessing the power of the disambiguation and interrelatedness across contexts. “It’s a very heterogeneous landscape,” he says. “Databases are often only consumable without error with background knowledge.” As he points out, Microsoft uses Graphwise for its own helpdesk portal.

Create More Relevant KPIs with the Help of Agentic AI’s Dynamic Interfaces and a Transformed Data Infrastructure

As airline manufacturing industry knowledge and enterprise architect Mara Inglezakis Owens points out, generative AI interface innovation opens new opportunities for knowledge graph builders to “interrogate” and scrutinize what’s under the hood in enterprise systems. 

In a similar vein, information architect Helyx Horwitz at Graphwise partner Enterprise Knowledge identified whole categories of key performance indicators (KPIs) that can be developed to assess the value of knowledge-based systems. 

Consider the notion of system scale and when leadership can demand larger scale (number of users or systems integrated)versus when they’d seek to reduce scale  (deduplication for cost or carbon footprint reduction, for example). 

In general, enterprises should revisit utility and usability metrics to be able to obtain a clear view of how semantic layer graphRAG transformation impacts AI system performance. 

A table illustrating KPIs for the semantic layer with categories for objectives, scale, content, and time, highlighting metrics for improving onboarding and internal learning efficiency.

Helyx Horwitz,“Measuring the Value of Your Semantic Layer,” https://enterprise-knowledge.com/measuring-the-value-of-your-semantic-layer-kpis-for-taxonomies-ontologies-and-knowledge-graphs, February 13, 2024.

Harnessing the power of agents and a true semantic layer togethe

Semantics in a transformed system implies harmony and cohesiveness across departmental and dataset boundaries, which leads to insights that would have been previously unachievable. Now consider adding intelligent agents to the mix to extend the reach of your transformation, making it possible to tackle system upgrades that weren’t previously feasible. 

In a 2022 blogpost, Dave McComb, President of Graphwise partner SemanticArts, pondered the dependencies of legacy systems and the “risk of violating some of those dependencies.” For decades now, enterprises have been fearful of innovation initiatives that seek to retire legacy subsystems. 

However, in today’s world, agents assuming a robust semantic layer and GraphRAG-oriented AI system could make your dependency identification and remediation efforts much more comprehensive. You could identify lower-risk subsystems for replacement and surface helpful ancillary techniques that in previous years might not have occurred to you. 

In that sense, graphRAG makes it possible, not only to start building out advanced AI capabilities, but also to reduce your legacy footprint, a piece at a time.

4 responses to “How to Boost the ROI of your AI Investment  with GraphRAG”

  1. You are stretching the limits of my understanding. I bit about controlling and managing the content gathered for an enterprise LLM makes sense. But what about those of us who are not able to manage the content of the LLM we we use with ChatGPT, Claud and others. Do we just need to put up with the hallucinations and analyze results for clarity and accuracy or is there another alternative? I recent was working on a piece where much of the content came from published books. If I am correct, ChatGPT can not really access the books directly but have to determine the book content from second hand sources. So the responses I get may be good but also may not be complete or totally accurate. Am I getting this right?

    1. These are good questions, Terry.

      The idea with semantic graph RAG from an enterprise point of view is to use a language model (large, medium or small) as a front-end interpreter and agent, but to retrieve answers from internal sources using semantic search (generated SPARQL queries to a deterministic graph database) or similarity search (Lucene-like query to a vector database like Elasticsearch).

      Graphwise streamed a webinar yesterday that explains its Talk to Your Graph (TTYG) product. At about 27 minutes in, Semantic Partners (a boutique consultancy and Graphwise partner) provides a demo of a KM use case. You can see how the end user makes a choice about the method based on the kind of question posed. Closed-end questions, for example, would use a SPARQL query to GraphDB. If the question is more open ended and synonyms are important, the user can choose similarity search.

      https://graphwise.ai/event/webinar-talk-to-your-graph/

      It’s still early days with graph RAG, and the trusted computing capabilities inherent in standard semantic graphs need to be built up and fleshed out. But the use of the LM in this context is mainly (as I understand it) as a UX coupled with a direct database retrieval capability that’s inherently more trustworthy.

  2. Love this comment
    “Keep in mind that almost all of the thousands of AI startups who are venture funded aren’t worth the investment.”
    Will have to bring this up with my Sand Hill Road friends (if I have any left)

    1. I saw a report from Traxn (a research company that uses automated data mining methods to generate statistics) that estimated there were 8,600+ venture-funded AI startups globally. I’m guessing there are <86 funded startups that are NeSy (neurosymbolic) and semantic standards based in their use of knowledge graphs.

Leave a Reply

Trending

Discover more from The GraphRAG Curator

Subscribe now to keep reading and get access to the full archive.

Continue reading