Enable cache access #169

t-nil · 2024-01-01T16:31:53Z

Hey,

I really want to be able to edit the cache. So I started refactoring a bit. Two things:

As you can see, I started to replace the Arc<> with just clones of String and PathBuf. Usage was inconsistent over the files and it seems unnecessary. I assume you wanted to gain speed/save RAM? Copying FFmpeg Param struct even over like 20 samples and 10 CRF search runs should be negligible RAM usage compared to what the video encoder takes, and speed should also not nearly be a bottleneck to any use case. If anything, the whole thread-safe accessing over tokyo threads and mutex locking could make things slower, compared to cheap memcpys (atleast thats my understanding). Would you be ok with me simplifying those structs?
Can I ask why you use sled? Again, performance should not be an issue (a few ms lookup vs minutes to hours of encoding), and with JSON we have the possibility to
1. edit the cache with text editors (deleting entries primarily),
2. version it in GIT (if experimenting or discovering bugs), and best of all
3. saving cleartext params instead of only hashes (so we can precisely look up past runs).
Would you allow me to change the backend to JSON (or similar)? It would probably make things easier instead of maintaining two formats (and should also make development alot easier since you can just look at the runs directly).

I don't want to come of as criticizing, I'm sure you noticed I enjoy working with and on your program and design decisions in the start are hard if not impossible to get right :) so thanks again!

alexheretic · 2024-01-02T18:39:42Z

I do intend to replace sled, I think with sqlite. Having multiple small json files is very inefficient on most filesystems, having one/few big json files is kinda just rolling our own embedded db.

I really want to be able to edit the cache

This isn't something I would generally envision users doing. What's your motivation for doing so?

For example, perhaps we could include time with cache and offer a cache pruning command + cache clearing command in the binary.

saving cleartext params instead of only hashes (so we can precisely look up past runs).

It would be interesting to more eagerly lookup info for crf-searches, but I do like the simplicity of the current approach and determinism of the search itself. I also will probably keep hashing filenames as it makes the cache less sensitive privacy wise.

WIP

9057dd8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable cache access #169

Enable cache access #169

t-nil commented Jan 1, 2024

alexheretic commented Jan 2, 2024

Enable cache access #169

Are you sure you want to change the base?

Enable cache access #169

Conversation

t-nil commented Jan 1, 2024

alexheretic commented Jan 2, 2024