LLM Project | End to End Gen AI Project Using Langchain, OpenAI in Finance Domain
Updated: October 26, 2025
Summary
The video showcases the development of a news research tool for equity research analysis using Lang chain OpenAI and Streamlit. It explores the creation of a chatbot tailored for research analysts to extract information from news articles efficiently. The technical process involves building a knowledge base from articles, utilizing chatbot capabilities, text splitting methods, and vector databases for faster searches. The tool's end-to-end coding demonstration highlights its potential for data science and NLP applications in the industry, emphasizing long-term project planning for a comprehensive tool.
TABLE OF CONTENTS
- Building an End-to-End LLM Project 
- Storytelling 
- Creating a Chatbot for Equity Research 
- Research Analysts' Daily Job 
- Challenges in Using Chat GPT 
- Technical Architecture and Knowledge Base 
- Text Loaders 
- Text Splitting Techniques 
- Vector Databases and Phase Index 
- Introduction to Phase Library 
- Retrieval QA with Sources Chain 
- Chunk Combination Using Stuff Method 
- Map Reduce Method 
- Coding Demonstration - Setting up Libraries 
- Coding Demonstration - Loading Articles 
- Coding Demonstration - Creating Embeddings 
- Coding Demonstration - Saving Vector Index 
- Coding Demonstration - Creating Retrieval Chain 
- User Interface Development - UI Setup 
- User Interface Development - Loading and Splitting Data 
- User Interface Development - Building Embeddings 
- User Interface Development - Saving Vector Index 
- User Interface Development - Asking Sample Questions 
- Combining Answers and Summary 
- End-to-End Coding Demonstration 
- Finalizing Coding and User Interface 
- Real-Life Use Case and Industry Relevance 
- Long-Term Project Planning 
- Project Completion and Appreciation 
Building an End-to-End LLM Project
The video introduces the project of building a news research tool using Lang chain openai and streamlit for equity research analysis.
Storytelling
A storytelling segment about Peter Pandey, an equity research analyst, and Rocky Boys recruitment team exploring the need for a chatbot like chat GPT for investments.
Creating a Chatbot for Equity Research
The process of creating a chatbot for equity research analysts, like Peter Pandey, and a real-life example of mutual funds and individual stock research.
Research Analysts' Daily Job
Detailing the daily tasks of research analysts focusing on reading news articles, reports, and conducting research on specific stocks like Tata Motors and Reliance.
Challenges in Using Chat GPT
Discussing the limitations of using chat GPT for equity research, such as copy-pasting articles, lack of aggregate knowledge base, and word limit restrictions.
Technical Architecture and Knowledge Base
Explaining the technical architecture involving building a knowledge base from news articles and utilizing chatbot capabilities to retrieve answers based on the knowledge base.
Text Loaders
Introducing text loaders in Lang chain for loading data from text files and CSV files to analyze and process text data effectively.
Text Splitting Techniques
Explaining text splitting methods using character text splitter and recursive character text splitter to break down text data into manageable chunks for processing.
Vector Databases and Phase Index
Introduction to vector databases and the use of the 'phase' in-memory vector database for faster similarity searches based on encoded vectors.
Introduction to Phase Library
A quick demo of the Phase Library usage in the news research tool project.
Retrieval QA with Sources Chain
Explanation of the retrieval QA process including asking questions, retrieving relevant chunks, and forming LLm prompts.
Chunk Combination Using Stuff Method
Discussion on combining chunks to stay within LLm token limits using the stuff method.
Map Reduce Method
Explanation of the map reduce method for individual LLm calls per chunk in the retrieval QA process.
Coding Demonstration - Setting up Libraries
Importing necessary libraries and initializing LLm object for the retrieval QA process.
Coding Demonstration - Loading Articles
Loading articles using unstructured URL loader and splitting them into individual chunks.
Coding Demonstration - Creating Embeddings
Generating embeddings from the individual chunks using the open API embedding method.
Coding Demonstration - Saving Vector Index
Saving the Vector Index into a file for future use and loading it into memory.
Coding Demonstration - Creating Retrieval Chain
Creating a retrieval QA chain and using the map reduce method for LLm calls per chunk.
User Interface Development - UI Setup
Setting up the UI for the news research tool application including input fields and buttons.
User Interface Development - Loading and Splitting Data
Loading and splitting data using unstructured URL loader and recursive text splitter in the UI.
User Interface Development - Building Embeddings
Creating embeddings from the chunks and using open API embedding in the UI.
User Interface Development - Saving Vector Index
Saving the Vector Index and processing the URLs in the news research tool UI.
User Interface Development - Asking Sample Questions
Asking sample questions related to the articles loaded in the news research tool UI.
Combining Answers and Summary
Combining answers from multiple LLm calls and creating a summary chunk in the UI.
End-to-End Coding Demonstration
End-to-end coding demonstration of the news research tool project in the UI environment.
Finalizing Coding and User Interface
Finalizing the code for the news research tool project in the UI and testing question-answering functionality.
Real-Life Use Case and Industry Relevance
Discussion on the real-life use case of the project and its application in the industry as a data science or NLP tool.
Long-Term Project Planning
Long-term project planning involving data ingestion, web scraping, embedding creation, and UI development for a comprehensive tool.
Project Completion and Appreciation
Final remarks on completing the project and encouraging sharing and usage of the developed news research tool code.
FAQ
Q: What is the purpose of the project introduced in the video?
A: The project aims to build a news research tool using Lang chain openai and streamlit for equity research analysis.
Q: What are some tasks involved in the daily work of research analysts?
A: Research analysts focus on tasks such as reading news articles, reports, and conducting research on specific stocks like Tata Motors and Reliance.
Q: What are some limitations discussed about using chat GPT for equity research?
A: Limitations of using chat GPT for equity research include copy-pasting articles, lack of aggregate knowledge base, and word limit restrictions.
Q: What is the technical architecture involved in building the knowledge base for the news research tool?
A: The technical architecture involves building a knowledge base from news articles and utilizing chatbot capabilities to retrieve answers based on the knowledge base.
Q: What are some methods used for text splitting in the project?
A: Text splitting methods used include character text splitter and recursive character text splitter to break down text data into manageable chunks for processing.
Q: What is the Phase Library used for in the project?
A: The Phase Library is used as an in-memory vector database for faster similarity searches based on encoded vectors.
Q: How is the retrieval QA process explained?
A: The retrieval QA process includes asking questions, retrieving relevant chunks, forming LLm prompts, combining chunks to stay within LLm token limits, and using the map reduce method for individual LLm calls per chunk.
Q: What is the purpose of setting up the UI for the news research tool application?
A: The UI is set up for loading data from unstructured URLs, splitting data using text splitters, creating embeddings from chunks, saving the Vector Index, and processing URLs.
Q: What are the steps involved in the end-to-end coding demonstration of the news research tool project?
A: Steps involve loading articles using unstructured URL loader, creating embeddings from individual chunks using open API embedding, saving the Vector Index into a file, and using the map reduce method for LLm calls per chunk.
Q: What is the long-term project planning discussed for the news research tool project?
A: Long-term project planning involves data ingestion, web scraping, embedding creation, and UI development to create a comprehensive tool.
Get your own AI Agent Today
Thousands of businesses worldwide are using Chaindesk Generative
              AI platform.
Don't get left behind - start building your
              own custom AI chatbot now!
