soft timeout for long running text extractions #397

hi-ko · 2022-04-04T09:21:58Z

as discusse on Discord the new transformer framework degrades scalability/stability due to more long-running threads.

The only work around by today is to increase timeouts for the http client but that will pile up the number of threads which is not a good idea. e.g.

solr.http.socket.timeout=30000
solr.http.connection.timeout=10000

To fix this, the tracker or repo web script should support a soft timeout that offloads the threads and triggers a mechanism as discussed in #396 to mark a node so that it is not captured by the content tracker and that automatically restores visibility to the tracker once the content has been transformed by a T-Engine.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

soft timeout for long running text extractions #397

soft timeout for long running text extractions #397

hi-ko commented Apr 4, 2022 •

edited

Loading

soft timeout for long running text extractions #397

soft timeout for long running text extractions #397

Comments

hi-ko commented Apr 4, 2022 • edited Loading

hi-ko commented Apr 4, 2022 •

edited

Loading