NEXT-37310 - Added single row import strategy on import error #35

CR0YD · 2024-08-21T06:46:28Z

When encountering an for now unhandleable error during an import of a chunk we return the whole error but skip the chunk.
I've added a new strategy where we try to import the chunk piece by piece to specifically filter the ones that are invalid (the error will be still returned to the user, cf. import.rs ll. 172 - 173).
There was also a small bug in remove_invalid_entries_from_chunk where if an row contained multiple invalid fields the row itself but also some of the following ones (2 errors -> 1 row after the invalid one, and so on) were filtered. To prevent that I added a filter for duplicates in import.rs l. 255.

LarsKemper

LGTM 👍

MalteJanz

Great work 💪 , only added some small notes 🙂

src/data/import.rs

MalteJanz · 2024-08-21T11:44:33Z

CHANGELOG.md

@@ -1,4 +1,5 @@
 # NEXT-RELEASE
+- NEXT-37310 - Added single row import strategy when encountering an unhandleable error during a chunk import.


Suggested change

- NEXT-37310 - Added single row import strategy when encountering an unhandleable error during a chunk import.

- NEXT-37310 - Added single row import strategy when encountering an error without a reference to a specific row during a chunk import.

I would not say that this formulation is better as there are errors, e.g. a product with a duplicate product number, that reference a specific row but also exit the sync call with an error.
In my opinion it would maybe be better to write something like "[...] when encountering an error that cannot be handled automatically during a chunk import".

I would not say that this formulation is better

Of course I also wouldn't say it is better, just a suggestion my mind came up with. Feel free to discard it 🙂 .

as there are errors, e.g. a product with a duplicate product number, that reference a specific row but also exit the sync call with an error.

Does it? I thought that in case we have an "error pointer" we would just remove that invalid entry from the chunk and retry. Of course the user sees the error.

And in the case where we don't have specific "error pointers", we fallback to one-by-one import and report any errors encountered there, which still shouldn't fail the whole import.

We only filter the rows that have a corresponding SwError::WriteError. But the error that is thrown when we have for example a duplicate product number is of the type SwError::GenericError so it won't be filtered by remove_invalid_entries_from_chunk.

And with my current implementation these falsy rows will still be skipped during the "single row import" as every row that triggers an automatically unhandleable error e.g. the "duplicate product number" error (not deadlocks and the like!) will just be ignored.

src/data/import.rs

github-actions · 2024-08-28T12:06:09Z

Summary of the total line code coverage for the whole codebase

Total lines	Covered	Skipped	% (pr)	% (main)
2154	1363	791	63.28	63.93

Summary of each file (click to expand)

File	Total lines	Covered	Skipped	%
src/api/filter.rs	171	168	3	98.25
src/api/mod.rs	477	293	184	61.43
src/cli.rs	45	31	14	68.89
src/config_file.rs	76	56	20	73.68
src/data/export.rs	136	0	136	0.00
src/data/import.rs	184	0	184	0.00
src/data/transform/mod.rs	336	259	77	77.08
src/data/transform/script.rs	292	258	34	88.36
src/data/validate.rs	298	298	0	100.00
src/main.rs	139	0	139	0.00

More details (click to expand)

Download full HTML report

You can download the full HTML report here: click to download
Hint: You need to extract it locally and open the index.html, there you can see which lines are not covered in each file.

You can also generate these reports locally

For that, you need to install cargo-llvm-cov, then you can run:

cargo llvm-cov --all-features --no-fail-fast --open

Hint: There are also other ways to see code coverage in Rust. For example with RustRover, you can execute tests with coverage generation directly in the IDE.

Remember

Your tests should be meaningful and not just be written to raise the coverage.
Coverage is just a tool to detect forgotten code paths you may want to think about, not your instructor to write tests

CR0YD requested review from jozsefdamokos, DennisGarding, MalteJanz, LarsKemper, ennasus4sun and philippdinkhoff August 21, 2024 06:46

CR0YD self-assigned this Aug 21, 2024

LarsKemper approved these changes Aug 21, 2024

View reviewed changes

MalteJanz reviewed Aug 21, 2024

View reviewed changes

LarsKemper added the bug Something isn't working label Aug 21, 2024

NEXT-37310 - Added single row import strategy on import error

d41610a

CR0YD force-pushed the next-37310/added-single-import-strategy-on-unhandleable-error-on-import branch from 6e4a5ba to d41610a Compare August 28, 2024 12:05

CR0YD merged commit 20940fd into main Aug 28, 2024
3 checks passed

CR0YD deleted the next-37310/added-single-import-strategy-on-unhandleable-error-on-import branch August 28, 2024 12:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NEXT-37310 - Added single row import strategy on import error #35

NEXT-37310 - Added single row import strategy on import error #35

CR0YD commented Aug 21, 2024

LarsKemper left a comment

MalteJanz left a comment

MalteJanz Aug 21, 2024

CR0YD Aug 23, 2024

MalteJanz Aug 23, 2024

CR0YD Aug 28, 2024 •

edited

Loading

github-actions bot commented Aug 28, 2024

Download full HTML report

You can also generate these reports locally

Remember

		@@ -1,4 +1,5 @@
		# NEXT-RELEASE
		- NEXT-37310 - Added single row import strategy when encountering an unhandleable error during a chunk import.

NEXT-37310 - Added single row import strategy on import error #35

NEXT-37310 - Added single row import strategy on import error #35

Conversation

CR0YD commented Aug 21, 2024

LarsKemper left a comment

Choose a reason for hiding this comment

MalteJanz left a comment

Choose a reason for hiding this comment

MalteJanz Aug 21, 2024

Choose a reason for hiding this comment

CR0YD Aug 23, 2024

Choose a reason for hiding this comment

MalteJanz Aug 23, 2024

Choose a reason for hiding this comment

CR0YD Aug 28, 2024 • edited Loading

Choose a reason for hiding this comment

github-actions bot commented Aug 28, 2024

Download full HTML report

You can also generate these reports locally

Remember

CR0YD Aug 28, 2024 •

edited

Loading