Incubator

This repo is the official implementation for Incubating Text Classifiers Following User Instruction with Nothing but LLM. We allow users to get a personalized classifier with only the instruction as input. The incubation is based on a llama-2-7b fine-tuned on Huggingface Meta Data and Self-Diversification.

Incubating Classifiers

You can use the script incubate.sh to incubate your own classifiers.

python incubate.py --n_epoch 16 \
    --batch_size 4 \
    --device 1 \
    --n_sample 16 \
    --max_new_tokens 64 \
    --instruction "Build a classifier that can categorize text messages by 'about food' and 'about movie'." \
    --incubator "KomeijiForce/Incubator-llama-2-7b" \
    --classifier "roberta-base" \
    --save_path "roberta-base-incubated"

By running the default incubation script, you can view the following output on default test cases:

Input: I love 'Spiderman 2'!
Predicted Label: about movie
Input: I ate a delicious pudding!
Predicted Label: about food

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
__pycache__		__pycache__
README.md		README.md
classifier.py		classifier.py
incubate.py		incubate.py
incubate.sh		incubate.sh
overview.jpg		overview.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Incubator

Incubating Classifiers

About

Releases

Packages

Languages

KomeijiForce/Incubator

Folders and files

Latest commit

History

Repository files navigation

Incubator

Incubating Classifiers

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages