Shard Quality¶
A shard refers to dividing larger data into multiple segments, which facilitates model training by making it easier to locate and hit the data. d.run supports viewing the quality of shards. The specific steps are as follows:
-
In the Data Analysis section, click Shard Quality, and use the Search function to find the shard you are interested in. Click the shard to enter the details page, where you can view detailed information about this shard.
-
You can view the following content:
- Corpus: Which corpus the shard belongs to.
- Update Time: The last update time of the shard file.
- Shard ID: The unique identification code of the shard.
- Shard Content: The specific content of the shard after slicing.
- Additional Information: Additional content related to this shard.
-
When new shard files are evaluated, you can click the Refresh button in the upper right corner to view the latest shard files.