An Innovative Hybrid Rag Solution for Boosting the Performance of an AI Assistant in Finance

code red

Retrieval Augmented Generation (RAG) is a key technique in various generative AI applications, such as chatbots or AI assistants, for retrieving information from a massive corpus and generating comprehensive answers. Traditional RAG approaches rely heavily on vector similarity search, which often result in incorrect, incomplete, or out-of-date answers. Graph RAG leverages knowledge graphs to retrieve contextually relevant nodes and paths for LLM reasoning, which can improve the accuracy and quality of the answers to a certain degree, but creating and maintaining knowledge graphs can be quite costly and time-consuming, and new errors may be introduced if there’s any inaccurate information in knowledge graphs.

We designed and deployed an innovative end-to-end Hybrid RAG solution to enhance our chatbot’s performance, which integrates key entities from our knowledge graph with vector similarity based RAG, plus other LLM based techniques, including RAG based Name Entity Recognition (NER), Date Standardization, Orchestration/Intent Classification, Key Intent prompting, Solr + Vector similarity search, and multi-step reranking. The new solution boosts the accuracy of the chatbot by almost 100%, from ~40% to ~ 80%. This solution also sets up a fundamental framework for the future agentic infrastructure. 


Speaker

Ken Zhang

Distinguished Engineer, global lead of Data Science and Engineering within Research Technology @Morgan Stanley

Ken Zhang is a Distinguished Engineer and the global lead of Data Science and Engineering within Research Technology at Morgan Stanley. He is responsible for driving AI/ML related initiatives for Research by leveraging cutting edge techniques such as machine learning, deep learning, natural language processing, and Large Language Models / Generative AI. With his team, Ken has created over 20 AI/ML products for Morgan Stanley Research and external clients.

Ken has been granted three patents in the last three years, and his work has been used in over 30 Morgan Stanley Research reports, with readership of over 170k.

Ken actively shares his AI/ML knowledge in various internal and external events, conferences, summits, and client facing meetings. His contributions and influence span across the Firm as a noted expert in the field of AI/ML, and passionate about his work and sharing knowledge.

He joined Morgan Stanley in 2017 and has 25 years of professional experience across multiple industries including banking, academia, and healthcare. He has a PhD in Mathematics from Rutgers University and is an adjunct professor for Financial Engineering at New York University.

Read more

Date

Tuesday Dec 16 / 03:40PM EST ( 50 minutes )

Location

Hosack Hall

Video

Video is not available

Slides

Slides are not available

Share