Skip to content

Commit

Permalink
Moving image again and correcting reference.
Browse files Browse the repository at this point in the history
  • Loading branch information
crlandsc committed May 16, 2024
1 parent b100c13 commit 4fcc009
Show file tree
Hide file tree
Showing 4 changed files with 10 additions and 5 deletions.
5 changes: 5 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,3 +18,8 @@ Error in tensor shapes. Was processing as the incorrect [batch, channels, stem,

#### Include Image
Moved frequency weighting image to be included in the package.

## 0.1.4 (2024-05-16)

#### Image Bug
Moving image again and correcting reference.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -65,9 +65,9 @@ The goal of this metric is to account for several factors not present in current
##### Frequency Weighting
To measure the frequencies of a signal closer to that of human hearing, the following frequency weighting is applied. This helps the model effectively pay less attention to errors at frequencies that humans are not sensitive to (e.g. 50 Hz) and give more weight to those that we are acutely tuned to (e.g. 3kHz).

![Frequency Weighting](torch_log_wmse_audio_quality/images/frequency_weighting.png)
![Frequency Weighting](https://raw.githubusercontent.com/crlandsc/torch-log-wmse-audio-quality/main/images/frequency_weighting.png)

This metric has been constructed with high fidelity audio in mind (sample rates ≥ 44.1kHz). It theoretically could work for lower sample rates, like 16kHz, but the metric performs an internal resampling to 44.1kHz for consistency across any input sample rates.
This metric has been constructed with high-fidelity audio in mind (sample rates ≥ 44.1kHz). It theoretically could work for lower sample rates, like 16kHz, but the metric performs an internal resampling to 44.1kHz for consistency across any input sample rates.

##### Inputs
Unlike many audio quality metrics, logWMSE accepts 3 audio inputs rather than 2:
Expand Down Expand Up @@ -95,5 +95,5 @@ Contributions are welcome! Please open an issue or submit a pull request if you
This project is licensed under the Apache License 2.0. See [LICENSE](LICENSE) for details.


## Acknowledgements
## Acknowledgments
Thanks to [Whitebalance](https://www.whitebalance.co/) for backing this project.
File renamed without changes
4 changes: 2 additions & 2 deletions setup.cfg
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[metadata]
name = torch-log-wmse-audio-quality
version = 0.1.3
version = 0.1.4
author = Christopher Landschoot
author_email = [email protected]
license = Apache License 2.0
Expand All @@ -25,7 +25,7 @@ install_requires =
numpy>=1.23.5

[options.package_data]
torch_log_wmse_audio_quality = filter_ir.pkl, images/frequency_weighting.png
torch_log_wmse_audio_quality = filter_ir.pkl

[options.packages.find]
where = .

0 comments on commit 4fcc009

Please sign in to comment.