- GitHub Copilot: assistente de código desenvolvido pelo GitHub baseado em modelos da OpenAI
[assistant co-pilots]
[page]
- Cursor: Chat and edit in an editor built for pair-programming with AI
[assistant co-pilots]
[page]
- Metabob: Generative AI for refactoring and debugging code
[assistant co-pilots]
[page]
- Bard for Development:
[assistant co-pilots]
[tweet]
- The Ethics of GitHub Copilot: Are Code Generating Models like Copilot and ChatGPT ethical or useful? Viewed from the lens of a software developer.
[assistant co-pilots]
[post]
- Created Developer Assitant: Created Developer Assitant is a platform designed to assist developers in efficiently analyzing, understanding, and fixing code within GitHub repositories.
[assistant co-pilots]
[tweet]
- GPTutor: Create with Chat-GPT to assist in explaining the code
[assistant co-pilots]
[paper]
- The Impact of AI on Developer Productivity: Evidence from GitHub Copilot:
[assistant co-pilots]
[]
- Bhttps://www.freecodecamp.org/news/build-a-full-stack-application-using-chatgpt/GPT:
[assistant co-pilots, outros, prompt engineering]
[video]
- Alpaca: A Strong, Replicable Instruction-Following Model: We introduce Alpaca 7B, a model fine-tuned from the LLaMA 7B model on 52K instruction-following demonstrations.
[fine-tuning]
[paper]
- LLaVa Lightining: a new way to train a lightweight, multimodal GPT-4 model in just three hours for only $40.
[fine-tuning]
[post]
- FINETUNED LANGUAGE MODELS ARE ZERO-SHOT LEARNERS: This paper explores a simple method for improving the zero-shot learning abilities of language models. We show that instruction tuning—finetuning language models on a collection of datasets described via instructions—substantially improves zeroshot performance on unseen tasks.
[fine-tuning]
[paper]
- Learn How To Finetune The Redpajama LLM: An opensource Llama-clone suitable for commercial dev.
[fine-tuning]
[tweet]
- QLoRA 4-bit finetuning in a single GPU:
[fine-tuning]
[tweet]
- Improving LLMs with chain of thought: we can drastically improve LLMs reasoning performance using simple techniques that require no fine-tuning or task-specific verifiers
[llms plugins & tools]
[tweet]
- ReAct: Synergizing Reasoning and Acting in Language Models: In this paper, we explore the use of LLMs to generate both reasoning traces and task-specific actions in an interleaved manner
[llms plugins & tools]
[paper]
- Transformers Agents: provides a natural language API on top of Hugging Face's transformers library
[llms plugins & tools]
[page]
- FlowiseAI: Open source UI visual tool to build your customized LLM flow using LangchainJS, written in Node Typescript/Javascript Star ⭐ Github
[llms plugins & tools]
[page]
- e2b - Your own virtual developer: Your own virtual developer
[llms plugins & tools]
[github]
- flowiseai - Build LLMs Apps Easily:
[llms plugins & tools]
[]
- MiniChain: Aims to implement the core prompt chaining functionality in a tiny digestable library
[llms plugins & tools, prompt engineering]
[github]
- Kor: This is a half-baked prototype that “helps” you extract structured data from text using large language models (LLMs)
[llms plugins & tools, prompt engineering]
[github]
- Promptify: Solve NLP Problems with LLM's & Easily generate different NLP Task prompts for popular generative models like GPT, PaLM, and more with Promptify
[llms plugins & tools, prompt engineering]
[github]
- Pandas AI: Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
[outros]
[github]
- Goodbye ChatGPT: These (New) AI Tools Will Leave You Speechless:
[outros]
[post]
- New Programming Abstraction: "- Specify the structured schema that your app desires. - Wrap it in an LLM decorator to parse unstructured user input. Prompting fades into the background. LLM is now in ""ambient mode”. - Use the object seamlessly in the rest of Software 1.0 stack. "
[outros]
[tweet]
- AutoPR: Fix issues with AI-generated pull requests, powered by ChatGPT
[outros]
[github]
- Introducing
smol-developer
: ▸ Human-centric, coherent whole program synthesis ▸ your own junior developer ▸ develop, debug, decompile ▸ open source: https://t.co/HNveJvgo7M ▸ 200 LOC, half english[outros]
[tweet]
- Context-Free Grammar Parsing with LLMs: Last week, I open-sourced a method to coerce LLMs into only generating a specific structure for a given regex pattern (https://matt-rickard.com/rellm on Github). The library has proven extremely useful for a handful of tasks I’ve been doing with LLMs (everything from categorization to simple agent automation).
[outros]
[post]
- Teaching Large Language Models to Self-Debug: In this work, we propose Self-Debugging, which teaches a large language model to debug its predicted program via few-shot demonstrations.
[outros]
[paper]
- k8sgpt: +tool for scanning your kubernetes clusters, diagnosing and triaging issues in simple english.
[outros]
[page]
- A guide to prompting AI (for what it is worth): A brief guide to prompting
[prompt engineering]
[post]
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models: chain of thought (a series of intermediate reasoning steps) significantly improves the ability of large language models to perform complex reasoning.
[prompt engineering]
[paper]
- Masterclass: AI-driven Development for Programmers: By Fireship. Learn how build a React app with Typescript using with AI prompting and other tricks to speed up development.
[prompt engineering]
[video]
- Twitter Algo Explainer -- by Matt Shumer: I used GPT-4-32K (+ other models) to analyze hundreds of files and explain how https://twitter.com/Twitter's open-source algorithm works
[prompt engineering]
[tweet]
- Automatic Prompt Engineering Techniques: automated prompt engineering via continuous updates (e.g., via SGD) to a prompt’s embedding
[prompt engineering]
[tweet]
- Advanced Prompt Engineering: cover three important topics in prompt engineering: chain of thought prompting, integration with external databases, and automatic prompt engineering approaches.
[prompt engineering]
[tweet]
- Prompt Engineering for ChatGPT: A Quick Guide to Techniques, Tips, and Best Practices: Let Genie (ChatGPT) teach you how to make wise Wishes (Prompts)
[prompt engineering]
[paper]
- Prompt Engineering Guide: 🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
[prompt engineering]
[page]
- The Power of Scale for Parameter-Efficient Prompt Tuning: proposes a mechanism for learning soft prompts through backpropagation.
[prompt engineering]
[paper]
- Prefix-Tuning: Optimizing Continuous Prompts for Generation: a lightweight alternative to fine-tuning that prepends a trainable continuous prefix for NLG tasks.
[prompt engineering]
[paper]
- Large Language Models Are Human-Level Prompt Engineers: APE for automatic instruction generation and selection, achieving better task performance than previous language models and comparable performance to human-generated prompts on most tasks.
[prompt engineering]
[paper]
- AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts: proposes an approach to automatically create prompts for a diverse set of tasks based on gradient-guided search.
[prompt engineering]
[paper]
- ChatGPT Prompt Engineering for Developers: • Learn prompt engineering best practices for application development • Discover new ways to use LLMs, including how to build your own custom chatbot • Gain hands-on practice writing and iterating on prompts yourself using the OpenAI API
[prompt engineering]
[video]
- Chat with any Github Repo: 1) Scrapes @github repo 🤖 2) 🗂️ Embeds codebase using @LangChainAI, stores embeddings in @activeloopai 3) Chat with the codebase w/ @streamlit 👨💻
[semantic search e Q&A]
[tweet]
- I built a ChatGPT app that lets you chat with any codebase:
[semantic search e Q&A]
[tweet]
- Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks: a novel framework called Search-in-the-Chain (SearChain) to improve the accuracy, credibility and traceability of LLM-generated content for multi-hop question answering
[semantic search e Q&A]
[paper]
- Sabiá: Portuguese Large Language Models: In this paper, we add to the growing body of evidence that challenges this practice, demonstrating that monolingual pretraining on the target language significantly improves models already extensively trained on diverse corpora.
[training]
[paper]
- "Google ""We Have No Moat, And Neither Does OpenAI""": Leaked Internal Google Document Claims Open Source AI Will Outcompete Google and OpenAI
[training]
[post]
- Introducing StarCoder: StarCoder is a 15B LLM for code with 8k context and trained only on permissive data in 80+ programming languages.
[training]
[tweet]
- MosaicML MPT-7B: MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available for commercial use, and matches the quality of LLaMA-7B.
[training]
[page]
- Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes: However, finetuning and distillation require large amounts of training data to achieve comparable performance to LLMs. We introduce Distilling step-by-step, a new mechanism that (a) trains smaller models that outperform LLMs, and (b) achieves so by leveraging less training data needed by finetuning or distillation.
[training]
[paper]
- How to run LLaMA 13B with a 6GB card:
[training]
[tweet]
- Formal Algorithms for Transformers: This document aims to be a self-contained, mathematically precise overview of transformer architectures and algorithms (not results). It covers what transformers are, how they are trained, what they are used for, their key architectural components, and a preview of the most prominent models. The reader is assumed to be familiar with basic ML terminology and simpler neural network architectures such as MLPs.
[training]
[]
- CodeT5+: Open Code Large Language Models for Code Understanding and Generation:
[training]
[paper]
- The Stack: The Stack contains over 6TB of permissively-licensed source code files covering 358 programming languages.
[training]
[github]
- Evaluating Large Language Models Trained on Code: We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J solves 11.4%. Furthermore, we find that repeated sampling from the model is a surprisingly effective strategy for producing working solutions to difficult prompts. Using this method, we solve 70.2% of our problems with 100 samples per problem. Careful investigation of our model reveals its limitations, including difficulty with docstrings describing long chains of operations and with binding operations to variables. Finally, we discuss the potential broader impacts of deploying powerful code generation technologies, covering safety, security, and economics.
[training]
[paper]
- How to train your own LLMs:
[training]
[post]
- Training language models to follow instructions with human feedback:
[training]
[]
- HumanEval: Hand-Written Evaluation Set:
[training]
[]