시리즈

논문 리뷰

1.[논문 리뷰] Taking Notes on the Fly Helps Language Pre-Training (TNF)

Masked language modeling의 pre-training 효율성을 개선하는 방법

2023년 9월 20일

2.[논문 리뷰] Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks (DAPT + TAPT)

ㅇㅅㅇ

2023년 9월 20일

3.[논문 리뷰] GPT Understands, Too (P-Tuning)

2023년 9월 20일

4.[논문 리뷰] Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference (PET)

2023년 9월 20일

5.[논문 리뷰] Shortformer: Better Language Modeling Using Shorter Inputs

2023년 9월 20일

6.[논문 리뷰] Diversifying Dialog Generation via Adaptive Label Smoothing (AdaLabel)

2023년 9월 20일

7.[논문 리뷰] ALBERT: A Lite BERT For Self-Supervised Learning of Language Representations

2023년 9월 20일

8.[논문 리뷰] BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

2023년 9월 20일

9.[논문 리뷰] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

2023년 9월 20일

10.[논문 리뷰] NExT-GPT: Any-to-Any Multimodal LLM

ㅇㅅㅇ

2023년 10월 1일

11.[논문 리뷰] Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs (SignRound)

ㅇㅅㅇ

2023년 10월 1일

12.[논문 리뷰] Neurons in Large Language Models: Dead, N-gram, Positional

ㅇㅅㅇ

2023년 10월 1일

13.[논문 리뷰] Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts

ㅇㅅㅇ

2023년 10월 1일

14.[논문 리뷰] Statistical Rejection Sampling Improves Preference Optimization (RSO)

ㅇㅅㅇ

2023년 10월 1일

15.[논문 리뷰] Explaining Grokking Through Circuit Efficiency

ㅇㅅㅇ

2023년 10월 1일

16.[논문 리뷰] Generative Image Dynamics

2023년 10월 1일

17.[논문 리뷰] Compositional Foundation Models for Hierarchical Planning (HiP)

2023년 10월 1일

18.[논문 리뷰] Contrastive Decoding Improves Reasoning in Large Language Models (CD)

ㅇㅅㅇ

2023년 10월 1일

19.[논문 리뷰] Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Funtions

2023년 10월 1일

20.[논문 리뷰] Chain-of-Verification Reduces Hallucination in Large Language Models (CoVe)

2023년 10월 1일

21.[논문 리뷰] OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

2023년 10월 1일

22.[논문 리뷰] Decision Transformer: Reinforcement Learning via Sequence Modeling

2023년 11월 14일

23.[논문 리뷰] MOReL: Model-Based Offline Reinforcement Learning

2023년 11월 14일

24.[논문 리뷰] A Game Theoretic Framework for Model Based Reinforcement Learning

2023년 11월 14일

25.[논문 리뷰] When to Replan? An Adaptive Replanning Strategy for Autonomous Navigation using Deep Reinforcement Learning

2023년 11월 14일

26.[논문 리뷰] Individual Reward Assisted Multi-Agent Reinforcement Learning

2023년 11월 14일

27.[논문 리뷰] Offline Multi-Agent Reinforcement Learning with Knowledge Distillation

2023년 11월 14일

28.[논문 리뷰] Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning

2023년 11월 14일

29.[논문 리뷰] Deep Reinforcement Learning for UAV Intelligent Mission Planning

2023년 11월 14일

30.[논문 리뷰] High-Dimensional Continuous Control using Generalized Advantage Estimation (GAE)

2024년 1월 17일

31.[논문 리뷰] Making Large Language Models A Better Foundation For Dense Retrieval (LLaRA)

2024년 1월 17일

32.[논문 리뷰] SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

ㅇㅅㅇ

2024년 1월 17일

33.[논문 리뷰] Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

ㅇㅅㅇ

2024년 1월 17일

34.[논문 리뷰] Dolphins: Multimodal Language Model for Driving

ㅇㅅㅇ

2024년 1월 17일

35.[논문 리뷰] TinyLlama: An Open-Source Small Language Model

ㅇㅅㅇ

2024년 1월 17일

36.[논문 리뷰] Mixtral of Experts (Mixtral 8x7B)

ㅇㅅㅇ

2024년 1월 17일

37.[논문 리뷰] Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

ㅇㅅㅇ

2024년 1월 17일

38.[논문 리뷰] Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models

ㅇㅅㅇ

2024년 1월 17일

39.[논문 리뷰] LLMs Cannot Find Reasoning Errors, but Can Correct Them!

ㅇㅅㅇ

2024년 1월 17일

40.[논문 리뷰] Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

ㅇㅅㅇ

2024년 1월 17일

41.[논문 리뷰] Training Language Models to Follow Instructions with Human Feedback (InstructGPT)

ㅇㅅㅇ

2024년 1월 17일

42.[논문 리뷰] Llama 2: Open Foundation and Fine-Tuned Chat Models

ㅇㅅㅇ

2024년 1월 17일

43.AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotics Agents

ㅇㅅㅇ

2024년 2월 18일