Skip to content

Commit

Permalink
Mod: Update README.
Browse files Browse the repository at this point in the history
  • Loading branch information
Labbeti committed Jan 9, 2024
1 parent 38c5473 commit d9db8bd
Showing 1 changed file with 5 additions and 3 deletions.
8 changes: 5 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -67,7 +67,7 @@ aac-datasets-download --root "." clotho --subsets "dev"
```

## Datasets information
Here is the statistics for each dataset :
`aac-datasets` package contains 4 different datasets :

<!-- | | AudioCaps | Clotho | MACS | WavCaps |
|:---:|:---:|:---:|:---:|:---:|
Expand All @@ -85,7 +85,7 @@ Here is the statistics for each dataset :

For Clotho, the **dev** subset should be used for training, val for validation and eval for testing.

Here is additional statistics on the train subset for AudioCaps, Clotho and MACS:
Here is additional statistics on the train subset for AudioCaps, Clotho, MACS and WavCaps:

| | AudioCaps/train | Clotho/dev | MACS/full | WavCaps/full |
|:---:|:---:|:---:|:---:|:---:|
Expand All @@ -96,7 +96,9 @@ Here is additional statistics on the train subset for AudioCaps, Clotho and MACS
| Nb captions | 49,838 | 19,195 | 17,275 | 403,050 |
| Total nb words<sup>2</sup> | 402,482 | 217,362 | 160,006 | 3,161,823 |
| Sentence size<sup>2</sup> | 2-52 | 8-20 | 5-40 | 2-38 |
| Vocabulary<sup>2</sup> | 4724 | 4369 | 2721 | 24600 |
| Vocabulary<sup>2</sup> | 4724 | 4369 | 2721 | 24,600 |
| Annotated by | Human | Human | Human | Machine |
| Corrected by | Human | Human | None | None |

<sup>1</sup> This duration is estimated on the total duration of 46230/49838 files of 126.7h.

Expand Down

0 comments on commit d9db8bd

Please sign in to comment.