Process is not killing itself on fatal error (docker) #690

ozbarshalom · 2017-03-30T19:55:40Z

I am using docker (via OpenShift) with Mongo Connector and elastic2-doc-manager in their latest versions.
Everything is ok until I am getting for any reason ReadTimeoutError because one of the requests to the Elastic took more than 10sec.
In this case, the mongo-connector should send a SIGNAL to kill itself, It will help the Docker restart automatically (recover) and try to continue from the last oplog entry.

In my case, I noticed that the connector was "stuck" in this error for about 20 hours, And when I restarted it manually the connector failed to trace in time the changes happened in the MongoDB (there are about 20,000 updated per hour).

The text was updated successfully, but these errors were encountered:

ShaneHarvey · 2017-03-30T21:20:01Z

Can you post the trackback you're seeing in the logs? Also what version of mongo-connector and elastic-doc-manager are you running?

Seems related to yougov/elastic2-doc-manager#18

ozbarshalom · 2017-03-30T21:35:13Z

I am using elastic2-doc-manager version 0.3.0 and mongo-connector`s version from the latest commit on master branch.
Looks like a missing exception in the wrap_exceptions for ConnectionTimeout.
I just created a related issue mongodb-labs/elastic2-doc-manager#44 and opened a pull request.

Should handle this.

ShaneHarvey · 2017-03-31T00:44:38Z

Mapping the error to OperationFailed would cause mongo-connector to ignore the error and not exit. The real issue is that the elastic doc manager does not handle timeouts on its Elasticsearch client. That error should currently be uncaught and mongo-connector should exit. It should not be "stuck". Can you post the ReadTimeoutError error message or stacktrace that you're seeing?

ozbarshalom · 2017-04-02T09:11:37Z

I can`t export the traceback but, The error is: ReadTimeoutError: HTTPConnectionPool(host='hostname', port=9200): Read Time out. (read timeout=10)

Maybe it is not exit because I used --continue-on-error ? But it even did not continue... just got stuck

nzapponi · 2017-08-30T17:31:49Z

Have there been any updates on this? I'm still seeing this issue!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process is not killing itself on fatal error (docker) #690

Process is not killing itself on fatal error (docker) #690

ozbarshalom commented Mar 30, 2017 •

edited

Loading

ShaneHarvey commented Mar 30, 2017 •

edited

Loading

ozbarshalom commented Mar 30, 2017

ShaneHarvey commented Mar 31, 2017

ozbarshalom commented Apr 2, 2017

nzapponi commented Aug 30, 2017

Process is not killing itself on fatal error (docker) #690

Process is not killing itself on fatal error (docker) #690

Comments

ozbarshalom commented Mar 30, 2017 • edited Loading

ShaneHarvey commented Mar 30, 2017 • edited Loading

ozbarshalom commented Mar 30, 2017

ShaneHarvey commented Mar 31, 2017

ozbarshalom commented Apr 2, 2017

nzapponi commented Aug 30, 2017

ozbarshalom commented Mar 30, 2017 •

edited

Loading

ShaneHarvey commented Mar 30, 2017 •

edited

Loading