Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use 16bits arithmetic for F_4 #265

Open
vrancurel opened this issue Nov 2, 2018 · 3 comments
Open

Use 16bits arithmetic for F_4 #265

vrancurel opened this issue Nov 2, 2018 · 3 comments
Assignees

Comments

@vrancurel
Copy link
Contributor

Instead of using 32bits arithmetic we can use 16 bits arithmetic.

@vrancurel
Copy link
Contributor Author

vrancurel commented Nov 2, 2018

Here is a simulation: https://github.com/vrancurel/f4mul

@vrancurel
Copy link
Contributor Author

Lâm, can you give it a try ?

@vrancurel
Copy link
Contributor Author

vrancurel commented Nov 2, 2018

We can probably further optimize the loop we do to adjust with SIMD operations themselves, e.g. with cmpeq_epu16(), etc

Bitfields operation are in fact unlikely so we can process them only if the bitmap != 0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants