Skip to content

Data Corruption #30

@Zhenhan-Huang

Description

@Zhenhan-Huang

Hi, thank you for your good work. When I tried to use the dataset, I found some data seem to be corrupted. The corrupted data I encounted are

PIL.UnidentifiedImageError: cannot identify image file 'df40_dataset/MidJourney/fake/13592447708_Clean_East_Asian_male_face_only_face_shown_close-up_3ad609fd-9d87-4242-9d5b-d2776a3c2c7e.png

PIL.UnidentifiedImageError: cannot identify image file 'df40_dataset/starganv2/real/547.jpg'

When I used file, it showed that these files were empty. Can you check it?

Besides, REAME file says testing data including fake images only, but some datasets have subdirectories such as fake and real (e.g. heygen_new). Do images in real subdirectories are real images?

When I tried to unzip some zip files, it failed. The error message was error [pixart.zip]: start of central directory not found; zipfile corrupt.. I encountered error when using e4e.zip, pixart.zip and sd2.1.zip.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions