Todo

Add the following models:
- https://huggingface.co/bartowski/Qwen2.5-Coder-7B-Instruct-abliterated-GGUF
- https://huggingface.co/bartowski/Qwen2.5-Coder-7B-Instruct-GGUF
Provide the chat history in the context_aware_answer.
Experiment Agentic Patterns:
System level safety:
- https://huggingface.co/meta-llama/Llama-Guard-3-1B - https://huggingface.co/tensorblock/Llama-Guard-3-1B-GGUF
  - Llama Guard 3-1B is a fine-tuned Llama-3.2-1B pretrained model for content safety classification.
- https://huggingface.co/meta-llama/Llama-Guard-3-8B
  - Llama Guard 3-8B is a fine-tuned Llama-3.1-8B pretrained model for content safety classification.
- https://huggingface.co/meta-llama/Llama-Guard-3-11B-Vision
  - Llama Guard 3 Vision is a Llama-3.2-11B pretrained model, fine-tuned for content safety classification.
Experiment Multimodal LLMs with Llama 3.2 Vision 11B (text + images in / text out)
- The model is currently not supported by llama.cpp ggerganov/llama.cpp#9643
- Is it supported just by (Ollama, so we need to use the Python API to create an additional client.
- Llama 3.2 Vision 11B requires least 8GB of VRAM, and the 90B model requires at least 64GB of VRAM.
- Take also a look here: https://huggingface.co/unsloth
Explore long term memory:
- https://help.openai.com/en/articles/8590148-memory-faq
- https://ai.gopubby.com/long-term-memory-for-agentic-ai-systems-4ae9b37c6c0f
- https://github.com/mem0ai/mem0
  - Explore also the structure of the repo https://github.com/mem0ai/mem0/tree/main/mem0 and the vector store implementation.
- https://github.com/letta-ai/letta
Investigate Chroma batch querying: https://github.com/langchain-ai/langchain/blob/907c758d67764385828c8abad14a3e64cf44d05b/libs/community/langchain_community/vectorstores/chroma.py#L42
Make docker container.
Test Flash attention:
- ggerganov/llama.cpp#5021
Investigate V-RAG (Vision RAG) https://github.com/Softlandia-Ltd/vision-is-all-you-need

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

todo.md

todo.md

Todo

Files

todo.md

Latest commit

History

todo.md

File metadata and controls

Todo