FolioLLM - ETF Portfolio Construction Using LLM

Description

FolioLLM is a project for Stanford University's CS224n course, aimed at developing a domain-specific large language model (LLM) to assist investors and financial professionals in constructing optimal portfolios of ETFs. Leveraging advancements in LLM technology, FolioLLM is designed to interpret user preferences and market conditions, providing personalized and actionable investment advice.

Objectives

Primary Goal: Investigate the capability of a fine-tuned LLM to effectively understand user preferences and market conditions to offer personalized ETF portfolio suggestions.
Secondary Goals:
- Evaluate FolioLLM's performance across various metrics including financial knowledge, portfolio optimization, and personalized recommendations.
- Compare the efficacy of FolioLLM with existing baselines, such as traditional portfolio optimization methods and other financial LLMs like FinGPT.

Methodology

Data Sources: Utilizes a blend of macroeconomic data, descriptive data of investment funds and ETFs, and securities information from financial databases like Bloomberg.
Approach:
- Pre-training: Begins with a pre-trained model, enhancing its financial understanding through domain-specific data and texts.
- Fine-tuning: Adapts the model specifically for ETF and portfolio management using curated datasets.
- Retrieval-Augmented Generation: Enhances the model's ability to provide relevant responses based on user queries.
- Portfolio Optimization: Integrates both traditional and modern optimization techniques to formulate optimal ETF allocations.

Evaluation

Financial Metrics: Uses Sharpe Ratio and Information Ratio to assess the risk-adjusted performance and benchmark comparisons.
NLP Metrics: Evaluates the coherence and relevance of the model's responses to ensure quality and accuracy.

Ethical Considerations

Bias and Fairness: Ensures a balanced training dataset to minimize biases in recommendations.
Transparency and Explainability: Aims to enhance model transparency by implementing methods that clarify the decision-making process, supported by visualizations and detailed explanations.

Additional Information

The project also explores innovative approaches like Low-Rank Adaptation (LoRA) and Kolmogorov-Arnold Network (KAN) techniques to further enhance the model's performance. The integration of these methods aims to improve the model's understanding of complex interactions in ETF data and deliver more accurate and relevant financial advice.

Links to Project Outputs

Final Report: This document provides a comprehensive overview of the FolioLLM project, including the methodology, experiments, results, and analysis.
Project Poster: A visual summary of the project, highlighting key objectives, methods, and findings.

These documents offer detailed insights into the development and evaluation of FolioLLM, showcasing its potential in the financial domain.

Name		Name	Last commit message	Last commit date
Latest commit History 211 Commits
Presentation		Presentation
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env-cpu.yml		env-cpu.yml
env-gpu.yml		env-gpu.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FolioLLM - ETF Portfolio Construction Using LLM

Description

Objectives

Methodology

Evaluation

Ethical Considerations

Additional Information

Links to Project Outputs

About

Releases

Packages

Contributors 3

Languages

License

olegroshka/FolioLLM

Folders and files

Latest commit

History

Repository files navigation

FolioLLM - ETF Portfolio Construction Using LLM

Description

Objectives

Methodology

Evaluation

Ethical Considerations

Additional Information

Links to Project Outputs

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages