A Comprehensive Overview of Large Language Models.pdf
A Comprehensive Survey on Transfer Learning.pdf
A Decade Survey of Transfer Learning (2010–2020).pdf
A General Language Assistant as aLaboratory for ALignment.pdf
A Survey of Large Language Models.pdf
A Survey on Transfer Learning.pdf
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language model.pdf
Context Tuning for Retrieval Augmented Generation.pdf
FAIRTUNE- OPTIMIZING PARAMETER EFFICIENT FINE FOR FAIRNESS IN MEDICAL IMAGING.pdf
Failure Modes of Learning Reward Models.pdf
Fine-Tuning Language Models from Human Preferences.pdf
Fine-Tuning Pretrained Language Models- Weight Initializatioins, Data Orders and Early Stopping.pdf
Fine-tuning Language Models for Factuality.pdf
Fine-tuning language models to find agreement among humans with diverse preferences.pdf
IMPROVING LARGE LANGUAGE MODEL FINE-TUNING FOR SOLVING MATH PROBLEMS.pdf
INSTRUCTION TUNING LARGE LANGUAGE MODEL ON REGION OF INTEREST.pdf
INTRINSIC DIMENSIONALITY EXPLAINS THE EFFECTIVENESS OF LANGUAGE MODEL FINE-TUNING.pdf
Instruction Tuning for Large Language Models A Survey.pdf
LORA-- LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS.pdf
Learning Reward for Physical Skills using Large Language Model.pdf
Learning to summarize from human feedback.pdf
Learning_the_Reward_Model_of_Dialogue_POMDPs_from_data.pdf
On the Effectiveness of Parameter-Efficient Fine-Tuning.pdf
PAIRWISE PROXIMAL POLICY OPTIMIZATION-- Harnessing relative feedback for LLM alignment.pdf
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Model.pdf
Parameter-Efficient Transfer Learning for NLP.pdf
Prefix-Tuning- Optimizing Continuous Prompts for Generation.pdf
Proximal Policy Optimization Algorithms.pdf
REWARD DESIGN WITH LANGUAGE MODELS.pdf
Retrieval-Augmented Generation for Knowledge Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Large Language Models-- A Survey.pdf
Revisiting Parameter-Efficient Tuning-Are We Really There Yet.pdf
STANDING ON THE SHOULDERS OF GIANT FROZEN LANGUAGE MODELS.pdf
Scalable agent alignment via reward modeling-- A research direction.pdf
Scaling Laws for Reward Model Overoptimization.pdf
Scaling laws for LLMs.pdf
Secrets of RLHF in Large Language Models.pdf
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics.pdf
Small Pre-trained Language Models Can be Fine-tuned as Large Models via Overparameterization.pdf
Talking About Large Language Models.pdf
The Power of Scale for Parameter-Efficient Prompt Tuning.pdf
Towards Better Parameter-Efficient Fine-Tuning for Large Language Models.pdf
Training a Helpful and Harmless Assistant with RLHF.pdf
Transfer Learning Toolkit--Primers and Benchmarks.pdf
Truly Proximal Policy Optimization.pdf
Trust Region Policy Optimization.pdf
Tuning Large language model for End-to-end Speech Translation.pdf
WebGPT-- Browser-assisted question-answering with human feedback.pdf
Your Language Model is Secretly a Reward Model.pdf
You can’t perform that action at this time.