Wang Yuqi's Blog

NLP Paper Notes-2


CodeBERT


SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification


Deduplicating Training Data Makes Language Models Better


Toolformer: Language Models Can Teach Themselves to Use Tools


Learning Performance Improving Code Edits


Symbolic Discovery of Optimization Algorithms


Neural Machine Translation of Rare Words with Subword Units


Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery


PromptCap: Prompt-Guided Task-Aware Image Captioning


A parallel corpus of Python functions and documentation strings for automated code documentation and code generation


AlphaCode


BERT


Language Models are Few-Shot Learners(GPT3)


SELECTIVE ANNOTATION MAKES LANGUAGE MODELS BETTER FEW-SHOT LEARNERS

Parameter-Efficient Transfer Learning for NLP

Parameter-Efficient Transfer Learning for NLP

Scaling Transformer to 1M tokens and beyond with RMT

1

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics