-
Notifications
You must be signed in to change notification settings - Fork 110
Pull requests: NVIDIA/NeMo-Curator
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Minor edits to instructions in readme, requirements, fuzzy and semantic dedupe flags
#548
opened Feb 14, 2025 by
ruchaa-apte
Loading…
Hard negative mining for Retriever fine-tuning
#523
opened Feb 5, 2025 by
vinay-raman
Loading…
3 tasks done
Clean up Pandas, cuDF, Dask, and Dask-cuDF Run GPU CI/CD on PR
DocumentDataset
type logic
gpuci
#494
opened Jan 23, 2025 by
sarahyurick
Loading…
Standardize Run GPU CI/CD on PR
text_field
and id_field
terminology
gpuci
#485
opened Jan 17, 2025 by
sarahyurick
Loading…
Add Run GPU CI/CD on PR
nemo-toolkit
dependency to gpuCI
gpuci
#480
opened Jan 10, 2025 by
sarahyurick
Loading…
Support
dask_expr
migration into dask.dataframe
#477
opened Jan 9, 2025 by
rjzamora
Loading…
3 tasks
[WIP] Add RAPIDS Nightly to GPU CI
gpuci
Run GPU CI/CD on PR
#436
opened Dec 17, 2024 by
praateekmahajan
•
Draft
3 tasks
Bump nltk from 3.8.1 to 3.9 in /tutorials/dapt-curation/code
dependencies
Pull requests that update a dependency file
#429
opened Dec 13, 2024 by
dependabot
bot
Loading…
Fix GPU error messages for fuzzy deduplication
gpuci
Run GPU CI/CD on PR
#387
opened Nov 22, 2024 by
sarahyurick
Loading…
2 tasks done
Remove Run GPU CI/CD on PR
max_text_bytes_per_part
gpuci
#385
opened Nov 20, 2024 by
sarahyurick
•
Draft
Create Run GPU CI/CD on PR
Cache
class for exact, fuzzy, and semantic deduplication
gpuci
#384
opened Nov 19, 2024 by
sarahyurick
Loading…
4 tasks done
Convert
translation_example.py
into a Jupyter Notebook tutorial
#336
opened Oct 29, 2024 by
sarahyurick
Loading…
Added example notebook for translation with ct2 model.
documentation
Improvements or additions to documentation
Adding an example for executing NeMo modules using kubernetes Python …
documentation
Improvements or additions to documentation
#148
opened Jul 9, 2024 by
dpadmanabhan03
Loading…
2 of 3 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.