Skip to content

Add support for transformer models#3761

Draft
marco-c wants to merge 5 commits intomozilla:masterfrom
marco-c:transformers
Draft

Add support for transformer models#3761
marco-c wants to merge 5 commits intomozilla:masterfrom
marco-c:transformers

Conversation

@marco-c
Copy link
Copy Markdown
Collaborator

@marco-c marco-c commented Oct 27, 2023

Add a simple example model for detecting defects, both with finetuning and with using embeddings to train a xgboost model

Fixes #39

Train on Taskcluster: defect_finetuning

@marco-c
Copy link
Copy Markdown
Collaborator Author

marco-c commented Nov 8, 2023

It would also be nice to support LoRa (or at least add an example on how to use it). https://huggingface.co/blog/Lora-for-sequence-classification-with-Roberta-Llama-Mistral

@marco-c marco-c changed the title WIP: Add support for transformer models Add support for transformer models Nov 24, 2023
@marco-c marco-c marked this pull request as ready for review November 24, 2023 17:51
@marco-c marco-c requested a review from suhaibmujahid August 4, 2025 10:05
@suhaibmujahid suhaibmujahid marked this pull request as draft December 23, 2025 20:37
@marco-c marco-c closed this Apr 1, 2026
@marco-c marco-c reopened this Apr 1, 2026
@marco-c
Copy link
Copy Markdown
Collaborator Author

marco-c commented Apr 1, 2026

The docker build task is hanging at:

Successfully tagged registry.hub.docker.com/mozilla/bugbug-base:latest
b3cd317af722d52d61ce0214c7a95ecc046171da421bbce0732f09745d06bd7d
INFO:taskboot.docker:Built image registry.hub.docker.com/mozilla/bugbug-base:latest
INFO:taskboot.docker:Saving image with tags registry.hub.docker.com/mozilla/bugbug-base:latest to /images/bugbug-base.tar

Maybe the image is too big?

@suhaibmujahid
Copy link
Copy Markdown
Member

The docker build task is hanging

It is moving now!

@marco-c
Copy link
Copy Markdown
Collaborator Author

marco-c commented Apr 1, 2026

It is super slow though!
I see:

/images/bugbug-base.tar : 45.96% ( 6.61 GiB => 3.04 GiB, /images/bugbug-base.tar.zst)

while on master:

/images/bugbug-base.tar : 37.84% ( 2.23 GiB => 864 MiB, /images/bugbug-base.tar.zst)

@marco-c
Copy link
Copy Markdown
Collaborator Author

marco-c commented Apr 23, 2026

I'm not sure why, but the situation seems far better now that we use uv.

@marco-c
Copy link
Copy Markdown
Collaborator Author

marco-c commented Apr 23, 2026

It looks like train on tc task is broken though, probably not due to this PR:

+ python -m scripts.trainer_extract_args
/usr/local/bin/python: Error while finding module specification for 'scripts.trainer_extract_args' (ModuleNotFoundError: No module named 'scripts')
+ python -m scripts.trainer
/usr/local/bin/python: Error while finding module specification for 'scripts.trainer' (ModuleNotFoundError: No module named 'scripts')

Maybe due to the uv migration?

@marco-c
Copy link
Copy Markdown
Collaborator Author

marco-c commented Apr 23, 2026

I'm not sure why, but the situation seems far better now that we use uv.

My bad, I didn't update uv.lock. It's still super slow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Try transformer-based models

2 participants