Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TASK] Support for Apache Tika 2+ #218

Open
dkd-kaehm opened this issue Jan 31, 2024 · 4 comments
Open

[TASK] Support for Apache Tika 2+ #218

dkd-kaehm opened this issue Jan 31, 2024 · 4 comments

Comments

@dkd-kaehm
Copy link
Contributor

What should be done in the scope of this task?
We should provide compatibility to Apache Tika 2+

@cyberelk
Copy link

What is the current status of this issue?

We have noticed that on a TYPO3 V11 installation with Tika app version 1.27, minor problems occur when reading the file contents: From time to time the Tika processes hang for no apparent reason and sporadically warnings are returned when reading out files (J2KImageReader not loaded. JPEG2000 files will not be processed. or org.xerial's sqlite-jdbc is not loaded.). The same occurs with TYPO3 V12 and Tika 1.28.

For test purposes, we have simply placed Tika version 2.9.2 next to it. The warnings no longer exist here. We were also unable to detect any hanging processes. However, the tests in this regard were also rather superficial.

So the question now arises:
From your point of view, what are the current arguments against using the Tika app in version 2.9.2 productively for versions 11 and 12 of TYPO3?

Kind regards, Jari

@dkd-kaehm
Copy link
Contributor Author

dkd-kaehm commented Jun 28, 2024

Apache Solr uses Tika 1.x.
We can chose any version of TIKA if Apache Solr drops TIKA CELL. See: #180

@dkd-kaehm
Copy link
Contributor Author

@cyberelk
Are you using Tika App or Tika Server mode?

@cyberelk
Copy link

At the moment we are using the app Version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants