Retrieval Augmented Generation (RAG) is a key technique in various generative AI applications, such as chatbots or AI assistants, for retrieving information from a massive corpus and generating comprehensive answers. Traditional RAG approaches rely heavily on vector similarity search, which often result in incorrect, incomplete, or out-of-date answers. Graph RAG leverages knowledge graphs to retrieve contextually relevant nodes and paths for LLM reasoning, which can improve the accuracy and quality of the answers to a certain degree, but creating and maintaining knowledge graphs can be quite costly and time-consuming, and new errors may be introduced if there’s any inaccurate information in knowledge graphs.

We designed and deployed an innovative end-to-end Hybrid RAG solution to enhance our chatbot’s performance, which integrates key entities from our knowledge graph with vector similarity based RAG, plus other LLM based techniques, including RAG based Name Entity Recognition (NER), Date Standardization, Orchestration/Intent Classification, Key Intent prompting, Solr + Vector similarity search, and multi-step reranking. The new solution boosts the accuracy of the chatbot by almost 100%, from ~40% to ~ 80%. This solution also sets up a fundamental framework for the future agentic infrastructure.

Ken Zhang is a Distinguished Engineer and the global lead of Data Science and Engineering within Research Technology at Morgan Stanley. He is responsible for driving AI/ML related initiatives for Research by leveraging cutting edge techniques such as machine learning, deep learning, natural language processing, and Large Language Models / Generative AI. With his team, Ken has created over 20 AI/ML products for Morgan Stanley Research and external clients.

Ken has been granted three patents in the last three years, and his work has been used in over 30 Morgan Stanley Research reports, with readership of over 170k.

Ken actively shares his AI/ML knowledge in various internal and external events, conferences, summits, and client facing meetings. His contributions and influence span across the Firm as a noted expert in the field of AI/ML, and passionate about his work and sharing knowledge.

He joined Morgan Stanley in 2017 and has 25 years of professional experience across multiple industries including banking, academia, and healthcare. He has a PhD in Mathematics from Rutgers University and is an adjunct professor for Financial Engineering at New York University.

An Innovative Hybrid Rag Solution for Boosting the Performance of an AI Assistant in Finance

Speaker

Ken Zhang

Speaker

Ken Zhang

Date

Location

Video

Slides

Share

InfoQ Resources

Social Media Links

Conference

Helpful Resources

InfoQ & QCon Events