Skip to content

Actions: NVIDIA/NeMo-Curator

Test Python package

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,650 workflow runs
1,650 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add TrafilaturaExtractor class
Test Python package #1760: Pull request #431 synchronize by sarahyurick
December 24, 2024 18:48 4m 59s sarahyurick:trafilatura_extractor
December 24, 2024 18:48 4m 59s
Skip flaky PyTests in test_download
Test Python package #1759: Pull request #454 synchronize by sarahyurick
December 24, 2024 18:44 4m 33s sarahyurick:flaky_test_common_crawl_news_urls
December 24, 2024 18:44 4m 33s
Change filename column name to file_name (#449)
Test Python package #1757: Commit 4fb7f54 pushed by praateekmahajan
December 24, 2024 03:42 5m 2s main
December 24, 2024 03:42 5m 2s
Add tests/test_classifiers.py PyTests (#421)
Test Python package #1754: Commit b8ff71e pushed by sarahyurick
December 23, 2024 20:48 4m 39s main
December 23, 2024 20:48 4m 39s
[DO NOT MERGE] Check gpuCI on non-main branch
Test Python package #1753: Pull request #455 opened by sarahyurick
December 23, 2024 20:15 4m 38s sarahyurick/ci/gpuci_target_branches
December 23, 2024 20:15 4m 38s
Fix GPU error messages for fuzzy deduplication
Test Python package #1752: Pull request #387 synchronize by sarahyurick
December 23, 2024 20:00 4m 28s sarahyurick:fuzzy_gpu_error
December 23, 2024 20:00 4m 28s
Skip flaky PyTests in test_download
Test Python package #1751: Pull request #454 synchronize by sarahyurick
December 23, 2024 19:10 4m 30s sarahyurick:flaky_test_common_crawl_news_urls
December 23, 2024 19:10 4m 30s
Add tests/test_classifiers.py PyTests
Test Python package #1750: Pull request #421 synchronize by sarahyurick
December 23, 2024 18:55 4m 47s sarahyurick:test_classifiers
December 23, 2024 18:55 4m 47s
Global cache_dir variable for exact, fuzzy, and semantic deduplication
Test Python package #1748: Pull request #384 synchronize by sarahyurick
December 23, 2024 18:17 4m 31s sarahyurick:global_cache_dir
December 23, 2024 18:17 4m 31s
Remove max_text_bytes_per_part
Test Python package #1747: Pull request #385 synchronize by sarahyurick
December 23, 2024 18:17 4m 40s sarahyurick:remove_max_text_bytes_per_part
December 23, 2024 18:17 4m 40s
Fix GPU error messages for fuzzy deduplication
Test Python package #1746: Pull request #387 synchronize by sarahyurick
December 23, 2024 18:17 5m 1s sarahyurick:fuzzy_gpu_error
December 23, 2024 18:17 5m 1s
Create separate files for each deduplication class
Test Python package #1745: Pull request #409 synchronize by sarahyurick
December 23, 2024 18:17 4m 53s sarahyurick:split_dedupe_files2
December 23, 2024 18:17 4m 53s
Create notebook tutorials for distributed data classifiers
Test Python package #1744: Pull request #415 synchronize by sarahyurick
December 23, 2024 18:17 4m 40s sarahyurick:distributed_notebooks
December 23, 2024 18:17 4m 40s
Update get_all_files_paths_under examples to include keep_extensions
Test Python package #1743: Pull request #450 synchronize by sarahyurick
December 23, 2024 18:16 4m 30s sarahyurick:filter_by_docs
December 23, 2024 18:16 4m 30s
Convert translation_example.py into a Jupyter Notebook tutorial
Test Python package #1742: Pull request #336 synchronize by sarahyurick
December 23, 2024 18:16 4m 33s sarahyurick:translation_tutorial
December 23, 2024 18:16 4m 33s
Add tests/test_classifiers.py PyTests
Test Python package #1741: Pull request #421 synchronize by sarahyurick
December 23, 2024 18:13 4m 38s sarahyurick:test_classifiers
December 23, 2024 18:13 4m 38s
Add TrafilaturaExtractor class
Test Python package #1740: Pull request #431 synchronize by sarahyurick
December 23, 2024 18:07 4m 28s sarahyurick:trafilatura_extractor
December 23, 2024 18:07 4m 28s
Add TrafilaturaExtractor class
Test Python package #1739: Pull request #431 synchronize by sarahyurick
December 23, 2024 18:01 4m 29s sarahyurick:trafilatura_extractor
December 23, 2024 18:01 4m 29s
Bug fix in dockerfile ARG vs ENV var (#446)
Test Python package #1738: Commit 35b5993 pushed by praateekmahajan
December 23, 2024 10:16 4m 33s main
December 23, 2024 10:16 4m 33s
ci: Update release.yml
Test Python package #1737: Pull request #452 opened by ko3n1g
December 21, 2024 19:30 4m 22s ko3n1g-patch-2
December 21, 2024 19:30 4m 22s
chore: Bump to 0.6.0rc2
Test Python package #1736: Pull request #451 opened by ko3n1g
December 20, 2024 22:57 4m 36s ko3n1g-patch-1
December 20, 2024 22:57 4m 36s