Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

数据集在本地使用校验命令可以通过,在模型产线进行数据校验却报错 #3352

Open
3 of 4 tasks
lyc12569 opened this issue Feb 14, 2025 · 1 comment
Open
3 of 4 tasks
Assignees

Comments

@lyc12569
Copy link

lyc12569 commented Feb 14, 2025

Checklist:

描述问题

数据集在本地使用校验命令可以通过,
Image
在模型产线进行数据校验却报错

2025-02-14 21:43:58,415 - pp-pipeline-exec - INFO - u-2579921-pp-p-843963b3af15-tt-84417e3cac32
2025-02-14 21:43:58,415 - pp-pipeline-exec - INFO - 执行命令:python /home/aistudio/PaddleX/main.py -c '/home/aistudio/PaddleX/paddlex/configs/text_detection/PP-OCRv4_mobile_det.yaml' -o Global.mode=check_dataset -o Global.dataset_dir=/home/aistudio/data -o Global.output=/home/aistudio/output
2025-02-14 21:44:01,931 - pp-pipeline-exec - INFO - Traceback (most recent call last):
File "/home/aistudio/PaddleX/paddlex/utils/result_saver.py", line 29, in wrap
result = func(self, *args, **kwargs)
File "/home/aistudio/PaddleX/paddlex/engine.py", line 38, in run
return self._model.check_dataset()
File "/home/aistudio/PaddleX/paddlex/model.py", line 90, in check_dataset
return dataset_checker.check()
File "/home/aistudio/PaddleX/paddlex/modules/base/dataset_checker/dataset_checker.py", line 67, in check
dataset_dir = self.get_dataset_root(self.global_config.dataset_dir)
File "/home/aistudio/PaddleX/paddlex/modules/text_detection/dataset_checker/init.py", line 44, in get_dataset_root
assert len(anno_dirs) == 1
AssertionError

复现

  1. 您使用的命令是?
    python main.py -c PP-OCRv4_mobile_det.yaml -o Global.mode=check_dataset -o Global.dataset_dir=C:\Users\20583\Desktop\群维新控\数据集\text_localization(在本地使用的命令,没有问题)
  2. 您使用的数据集是?
    自建数据集
  3. 请提供您出现的报错信息及相关log

2025-02-14 21:43:58,415 - pp-pipeline-exec - INFO - u-2579921-pp-p-843963b3af15-tt-84417e3cac32
2025-02-14 21:43:58,415 - pp-pipeline-exec - INFO - 执行命令:python /home/aistudio/PaddleX/main.py -c '/home/aistudio/PaddleX/paddlex/configs/text_detection/PP-OCRv4_mobile_det.yaml' -o Global.mode=check_dataset -o Global.dataset_dir=/home/aistudio/data -o Global.output=/home/aistudio/output
2025-02-14 21:44:01,931 - pp-pipeline-exec - INFO - Traceback (most recent call last):
File "/home/aistudio/PaddleX/paddlex/utils/result_saver.py", line 29, in wrap
result = func(self, *args, **kwargs)
File "/home/aistudio/PaddleX/paddlex/engine.py", line 38, in run
return self._model.check_dataset()
File "/home/aistudio/PaddleX/paddlex/model.py", line 90, in check_dataset
return dataset_checker.check()
File "/home/aistudio/PaddleX/paddlex/modules/base/dataset_checker/dataset_checker.py", line 67, in check
dataset_dir = self.get_dataset_root(self.global_config.dataset_dir)
File "/home/aistudio/PaddleX/paddlex/modules/text_detection/dataset_checker/init.py", line 44, in get_dataset_root
assert len(anno_dirs) == 1
AssertionError

环境

  1. 请提供您使用的PaddleX的版本号
    3.0.0
  2. 请提供您使用的操作系统信息,如Linux/Windows/MacOS
    windows
  3. 请问您使用的Python版本是
    3.8
@lyc12569
Copy link
Author

这是check_dataset_result.json文件
{ "done_flag": true, "check_pass": true, "attributes": { "train_samples": 761, "train_sample_paths": [ "..\\数据集\\text_localization\\image\\177.jpg", "..\\数据集\\text_localization\\image\\178.jpg", "..\\数据集\\text_localization\\image\\179.jpg", "..\\数据集\\text_localization\\image\\180.jpg", "..\\数据集\\text_localization\\image\\181.jpg", "..\\数据集\\text_localization\\image\\182.jpg", "..\\数据集\\text_localization\\image\\183.jpg", "..\\数据集\\text_localization\\image\\184.jpg", "..\\数据集\\text_localization\\image\\185.jpg", "..\\数据集\\text_localization\\image\\186.jpg" ], "val_samples": 356, "val_sample_paths": [ "..\\数据集\\text_localization\\image\\1.jpg", "..\\数据集\\text_localization\\image\\2.jpg", "..\\数据集\\text_localization\\image\\3.jpg", "..\\数据集\\text_localization\\image\\4.jpg", "..\\数据集\\text_localization\\image\\5.jpg", "..\\数据集\\text_localization\\image\\6.jpg", "..\\数据集\\text_localization\\image\\7.jpg", "..\\数据集\\text_localization\\image\\8.jpg", "..\\数据集\\text_localization\\image\\9.jpg", "..\\数据集\\text_localization\\image\\10.jpg" ] }, "analysis": { "histogram": "check_dataset\\histogram.png" }, "dataset_path": "C:\\Users\\20583\\Desktop\\群维新控\\数据集\\text_localization", "show_type": "image", "dataset_type": "TextDetDataset" }

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants