feat: improved logging #89

adubovik · 2024-12-27T14:22:47Z

Removed Batches: 100%|███████████████| 1/1 [00:00<00:00, 1.44it/s] tqdm progress bars printed on each request
Removed Chat completion response length {int} info logs printed on each request
Added logger and timings to the individual stages of message processing: topic, lang_id, influx
Used the same log format that is used in adapters
Added LOG_LEVEL env var that resolves Log level management #54

Before:

INFO:     Loading environment from '.env'
2024-12-27 14:20:29,709 [INFO] - Started server process [83620]
2024-12-27 14:20:29,709 [INFO] - Waiting for application startup.
2024-12-27 14:20:30,255 [INFO] - Load pretrained SentenceTransformer: all-mpnet-base-v2
2024-12-27 14:20:31,684 [INFO] - Use pytorch device_name: mps
Batches[encode]: 100%|███████████████████████████████████████████████████████████████| 1/1 [00:00<00:00,  1.74it/s]
2024-12-27 14:20:32,640 [INFO] - Application startup complete.
2024-12-27 14:20:32,640 [INFO] - Uvicorn running on http://127.0.0.1:5006 (Press CTRL+C to quit)
...
2024-12-27 14:20:58,702 [INFO] - Chat completion response length 4
Batches[encode]: 100%|███████████████████████████████████████████████████████████████| 1/1 [00:00<00:00,  3.35it/s]
2024-12-27 14:20:59,028 [INFO] - Chat completion response length 4
Batches[encode]: 100%|███████████████████████████████████████████████████████████████| 1/1 [00:00<00:00,  4.93it/s]
2024-12-27 14:20:59,244 [INFO] - Chat completion response length 4
Batches[encode]: 100%|███████████████████████████████████████████████████████████████| 1/1 [00:00<00:00,  3.33it/s]
2024-12-27 14:20:59,552 [INFO] - Chat completion response length 4
Batches[encode]: 100%|███████████████████████████████████████████████████████████████| 1/1 [00:00<00:00,  2.89it/s]
2024-12-27 14:20:59,943 [INFO] - Chat completion response length 4
Batches[encode]: 100%|███████████████████████████████████████████████████████████████| 1/1 [00:00<00:00,  6.47it/s]
2024-12-27 14:21:00,116 [INFO] - 127.0.0.1:62705 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,117 [INFO] - 127.0.0.1:62708 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,117 [INFO] - 127.0.0.1:62699 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,117 [INFO] - 127.0.0.1:62701 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,130 [INFO] - 127.0.0.1:62704 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,130 [INFO] - 127.0.0.1:62703 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,131 [INFO] - 127.0.0.1:62700 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,131 [INFO] - 127.0.0.1:62709 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,133 [INFO] - 127.0.0.1:62702 - "POST /data HTTP/1.1" 200
2024-12-27 14:21:00,134 [INFO] - 127.0.0.1:62710 - "POST /data HTTP/1.1" 200

After:

INFO:     Loading environment from '.env'
INFO:     | 2024-12-27 14:15:27,123 | 40334 | uvicorn.error | Started server process [40334]
INFO:     | 2024-12-27 14:15:27,124 | 40334 | uvicorn.error | Waiting for application startup.
INFO:     | 2024-12-27 14:15:28,204 | 40334 | sentence_transformers.SentenceTransformer | Load pretrained SentenceTransformer: all-mpnet-base-v2
INFO:     | 2024-12-27 14:15:29,233 | 40334 | sentence_transformers.SentenceTransformer | Use pytorch device_name: mps
INFO:     | 2024-12-27 14:15:30,463 | 40334 | uvicorn.error | Application startup complete.
INFO:     | 2024-12-27 14:15:30,480 | 40334 | uvicorn.error | Uvicorn running on http://127.0.0.1:5006 (Press CTRL+C to quit)
...
INFO:     | 2024-12-27 14:18:42,464 | 40334 | app | number of messages: 2
DEBUG:    | 2024-12-27 14:18:42,567 | 40334 | app | [2/2][topic] 0.151s
DEBUG:    | 2024-12-27 14:18:42,568 | 40334 | app | [2/2][langid] 0.004s
DEBUG:    | 2024-12-27 14:18:42,647 | 40334 | app | [1/2][influx] 2.377s
INFO:     | 2024-12-27 14:18:42,879 | 40334 | app | [1/2] success
DEBUG:    | 2024-12-27 14:18:42,934 | 40334 | app | [1/2] 2.721s
...
DEBUG:    | 2024-12-27 14:18:43,242 | 40334 | app | [2/2][influx] 0.558s
INFO:     | 2024-12-27 14:18:43,256 | 40334 | app | [2/2] success
DEBUG:    | 2024-12-27 14:18:43,345 | 40334 | app | [2/2] 0.758s
DEBUG:    | 2024-12-27 14:18:43,345 | 40334 | app | 2.676s
DEBUG:    | 2024-12-27 14:18:43,566 | 40334 | app | response: [{"status": "success"}, {"status": "success"}]
INFO:     | 2024-12-27 14:18:43,678 | 40334 | uvicorn.access | 127.0.0.1:62587 - "POST /data HTTP/1.1" 200

After V2:

INFO:     Loading environment from '.env'
INFO:     | 2024-12-27 14:15:27,123 | 40334 | uvicorn.error | Started server process [40334]
INFO:     | 2024-12-27 14:15:27,124 | 40334 | uvicorn.error | Waiting for application startup.
INFO:     | 2024-12-27 14:15:30,463 | 40334 | uvicorn.error | Application startup complete.
INFO:     | 2024-12-27 14:15:30,480 | 40334 | uvicorn.error | Uvicorn running on http://127.0.0.1:5006 (Press CTRL+C to quit)
...
INFO:     | 2024-12-27 14:18:42,464 | 40334 | app | number of messages: 2
DEBUG:    | 2024-12-27 14:18:42,567 | 40334 | app.topic | [2/2] 0.151s
DEBUG:    | 2024-12-27 14:18:42,568 | 40334 | app.langid | [2/2] 0.004s
DEBUG:    | 2024-12-27 14:18:42,647 | 40334 | app.influx | [1/2] 2.377s
INFO:     | 2024-12-27 14:18:42,879 | 40334 | app | [1/2] success
DEBUG:    | 2024-12-27 14:18:42,934 | 40334 | app | [1/2] 2.721s
...
DEBUG:    | 2024-12-27 14:18:43,242 | 40334 | app.influx | [2/2] 0.558s
INFO:     | 2024-12-27 14:18:43,256 | 40334 | app | [2/2] success
DEBUG:    | 2024-12-27 14:18:43,345 | 40334 | app | [2/2] 0.758s
DEBUG:    | 2024-12-27 14:18:43,345 | 40334 | app | 2.676s
DEBUG:    | 2024-12-27 14:18:43,566 | 40334 | app | response: [{"status": "success"}, {"status": "success"}]
INFO:     | 2024-12-27 14:18:43,678 | 40334 | uvicorn.access | 127.0.0.1:62587 - "POST /data HTTP/1.1" 200

Allob · 2025-01-17T17:33:22Z

INFO:     | 2024-12-27 14:18:42,464 | 40334 | app | number of messages: 2
DEBUG:    | 2024-12-27 14:18:42,567 | 40334 | app | [2/2][topic] 0.151s
DEBUG:    | 2024-12-27 14:18:42,568 | 40334 | app | [2/2][langid] 0.004s
DEBUG:    | 2024-12-27 14:18:42,647 | 40334 | app | [1/2][influx] 2.377s
INFO:     | 2024-12-27 14:18:42,879 | 40334 | app | [1/2] success
DEBUG:    | 2024-12-27 14:18:42,934 | 40334 | app | [1/2] 2.721s

Why do we want [langid] and others to be a message prefix instead of the name of the logger?
I think standard approach like logger = logging.getLogger("langid") or even logger = logging.getLogger(__name__) should work well enough here.

Allob · 2025-01-17T17:39:51Z

aidial_analytics_realtime/topic_model.py

        text = text.strip()
        if not text:
            return None

-        topics, _ = self.model.transform([text])
-        topic = self.model.get_topic_info(topics[0])
+        with Timer(with_prefix(logger, "[topic]").debug):


It might be more convenient to have a timer as a decorator for the function.

If I'm going to use a function decorator here, I will have to introduce a new function instead of using the code block.

How is it more convenient?

Allob · 2025-01-17T18:09:45Z

aidial_analytics_realtime/utils/log_config.py

-    # Setting up log levels
-    logger.setLevel(logging.DEBUG)
+    # Setting log levels for the analytics application
+    app_logger.setLevel(LOG_LEVEL)


Why is the LOG_LEVEL set for the app logger and not for the root logger here?

Setting it for the root logger enables logging in the depedency packages (like urllib3 and sentence_transformers). I'm not sure that's what we want. Typically LOG_LEVEL is understood as a log level of the application itself.

Moreover, we could not be sure that LOG_LEVEL=INFO in the dependency packages do not expose sensitive information. So I'm actually disinclined to enable any LOG_LEVEL for the root logger, even INFO.

adubovik · 2025-01-20T12:45:04Z

aidial_analytics_realtime/influx_writer.py

-    async def influx_writer_impl(record: Point):
-        await influx_write_api.write(bucket=influx_bucket, record=record)
+    async def influx_writer_impl(logger: Logger, record: Point):
+        with Timer(with_prefix(logger, "[influx]").debug):


Reply to: #89 (comment)

langid, influx and topic are the names of the tasks.
They do not map to the module names straightforwardly.
In which module a task is defined is random.
A simple module renaming breaks the format of logs.

fixed - used individual loggers

Allob · 2025-01-20T18:53:26Z

aidial_analytics_realtime/analytics.py

@@ -54,12 +51,18 @@ async def detect_lang_by_text(text: str) -> str | None:
    if not text:
        return None

+    logger = logging.getLogger("app.langid")


Let's move it outside of the function to a module level?

Allob · 2025-01-20T18:55:51Z

aidial_analytics_realtime/analytics.py

-    except Exception:
-        pass
+    except Exception as e:
+        logger.error(f"error: {str(e)}")


Why not logger.exception(e) to have stack trace included?

Allob · 2025-01-20T19:02:37Z

aidial_analytics_realtime/app.py

+        async def _task(i: int, message_str: str) -> dict:
+            add_logger_prefix(f"[{i}/{n}]")
+
+            async with Timer(logger.debug):


This timer results will likely to be skewed, because you will have a lot of tasks run concurrently due to asyncio.gather.
I'm not sure that this is what was expected here.

Allob · 2025-01-20T19:09:18Z

aidial_analytics_realtime/app.py

+        logger.info("success")
+        return {"status": "success"}
+    except starlette.requests.ClientDisconnect:
+        return _error("client disconnect")


Why not just use starlette.requests.ClientDisconnect here?
I'm not sure that all this extra code (including defining a function within a function) worth it since the error is not returned to the response anyway.

feat: improved logs

f0554f7

adubovik self-assigned this Dec 27, 2024

adubovik requested a review from Allob as a code owner December 27, 2024 14:22

adubovik added 5 commits December 30, 2024 11:21

Merge branch 'development' into feat/improved-logging

3d48014

feat: sending time's logs to debug

719e017

feat: added milliseconds to the logs

8e0370f

Merge branch 'development' into feat/improved-logging

6d06667

fix: minor fix

2add83f

Allob reviewed Jan 17, 2025

View reviewed changes

fix: removed log level setting for the root logger

9d5a86d

adubovik commented Jan 20, 2025

View reviewed changes

adubovik added 3 commits January 20, 2025 17:40

feat: migrated from LoggerAdapter to Custom filter

e3ca8a7

fix: removed logger argument everywhere

f153e85

fix: fixed issues with lost Context

950585d

Allob approved these changes Jan 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: improved logging #89

feat: improved logging #89

adubovik commented Dec 27, 2024 •

edited

Loading

Allob commented Jan 17, 2025

Allob Jan 17, 2025

adubovik Jan 20, 2025

Allob Jan 17, 2025 •

edited

Loading

adubovik Jan 20, 2025 •

edited

Loading

adubovik Jan 20, 2025 •

edited

Loading

adubovik Jan 20, 2025

Allob Jan 20, 2025

Allob Jan 20, 2025

Allob Jan 20, 2025

Allob Jan 20, 2025

feat: improved logging #89

Are you sure you want to change the base?

feat: improved logging #89

Conversation

adubovik commented Dec 27, 2024 • edited Loading

Allob commented Jan 17, 2025

Allob Jan 17, 2025

Choose a reason for hiding this comment

adubovik Jan 20, 2025

Choose a reason for hiding this comment

Allob Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

adubovik Jan 20, 2025 • edited Loading

Choose a reason for hiding this comment

adubovik Jan 20, 2025 • edited Loading

Choose a reason for hiding this comment

adubovik Jan 20, 2025

Choose a reason for hiding this comment

Allob Jan 20, 2025

Choose a reason for hiding this comment

Allob Jan 20, 2025

Choose a reason for hiding this comment

Allob Jan 20, 2025

Choose a reason for hiding this comment

Allob Jan 20, 2025

Choose a reason for hiding this comment

adubovik commented Dec 27, 2024 •

edited

Loading

Allob Jan 17, 2025 •

edited

Loading

adubovik Jan 20, 2025 •

edited

Loading

adubovik Jan 20, 2025 •

edited

Loading