fix(api): keep proto contents in bytes for longer #14446

sfoster1 · 2024-02-07T21:37:01Z

When we parse python protocols, we were doing it by (1) making it into a string with decode('utf-8') and then (2) passing the string ast.parse(). The problem with this is that decode('utf-8') does not apply "universal newlines", which means that the code object created by compiling the ast will have line numbers that are around twice what they should be under certain circumstances (windows machine, crlf file, mercury in the seventh house, etc). Then, when we go and display a nice error message about a syntax error or whatever, the user says "why is this error message pointing to a place past the end of my protocol".

This should fix that by keeping the protocol contents in bytes form all the way through to passing ast.parse() a bytes that has never been through str.decode('utf-8') which should preserve everything.

Testing

Use a protocol like the attached
flex_mag_mod.py.zip (unzip it first) that has (1) an error and (2) crlf newlines
On windows, simulate with opentrons_simulate and in the app
check that it says the error is on the right line, and not the line*2

When we parse python protocols, we were doing it by (1) making it into a string with decode('utf-8') and then (2) passing the string ast.parse(). The problem with this is that decode('utf-8') does not apply "universal newlines", which means that the code object created by compiling the ast will have line numbers that are around twice what they should be under certain circumstances (windows machine, crlf file, mercury in the seventh house, etc). Then, when we go and display a nice error message about a syntax error or whatever, the user says "why is this error message pointing to a place past the end of my protocol". This should fix that by keeping the protocol contents in bytes form all the way through to passing ast.parse() a bytes that _has never been through str.decode('utf-8')_ which should preserve everything.

codecov · 2024-02-07T22:21:07Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (8398c83) 68.10% compared to head (dbe8417) 68.03%.
Report is 7 commits behind head on chore_release-7.2.0.

Additional details and impacted files

@@                   Coverage Diff                   @@
##           chore_release-7.2.0   #14446      +/-   ##
=======================================================
- Coverage                68.10%   68.03%   -0.08%     
=======================================================
  Files                     2518     2518              
  Lines                    72031    72027       -4     
  Branches                  9245     9245              
=======================================================
- Hits                     49059    49003      -56     
- Misses                   20770    20822      +52     
  Partials                  2202     2202

Flag	Coverage Δ
g-code-testing	`92.43% <ø> (-4.06%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
api/src/opentrons/protocols/parse.py	`96.32% <ø> (-0.14%)`	⬇️
api/src/opentrons/protocols/types.py	`89.74% <ø> (ø)`
api/src/opentrons/util/entrypoint_util.py	`90.69% <ø> (ø)`

... and 5 files with indirect coverage changes

SyntaxColoring

Wow, juicy.

Very happy to have less encoding/decoding. Thank you!

api/src/opentrons/protocols/parse.py

SyntaxColoring

Thank you! CI is failing but it looks like some Python environment thing unrelated to this PR. This LGTM if tests and linting pass locally.

When we parse python protocols, we were doing it by (1) making it into a string with decode('utf-8') and then (2) passing the string ast.parse(). The problem with this is that decode('utf-8') does not apply "universal newlines", which means that the code object created by compiling the ast will have line numbers that are around twice what they should be under certain circumstances (windows machine, crlf file, mercury in the seventh house, etc). Then, when we go and display a nice error message about a syntax error or whatever, the user says "why is this error message pointing to a place past the end of my protocol". This should fix that by keeping the protocol contents in bytes form all the way through to passing ast.parse() a bytes that _has never been through str.decode('utf-8')_ which should preserve everything.

sfoster1 requested a review from a team as a code owner February 7, 2024 21:37

sfoster1 added 4 commits February 7, 2024 16:43

some little fixups

0a85029

i've become the joker

c9bcbec

format

0a4f918

the fool's debugger

f76f6c1

SyntaxColoring reviewed Feb 7, 2024

View reviewed changes

api/src/opentrons/protocols/parse.py Outdated Show resolved Hide resolved

comments

dbe8417

SyntaxColoring approved these changes Feb 9, 2024

View reviewed changes

sfoster1 merged commit 8dc6f79 into chore_release-7.2.0 Feb 9, 2024
18 of 22 checks passed

sfoster1 deleted the fix-py-error-numbers-crlf branch February 9, 2024 15:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(api): keep proto contents in bytes for longer #14446

fix(api): keep proto contents in bytes for longer #14446

sfoster1 commented Feb 7, 2024

codecov bot commented Feb 7, 2024 •

edited

Loading

SyntaxColoring left a comment

SyntaxColoring left a comment

fix(api): keep proto contents in bytes for longer #14446

fix(api): keep proto contents in bytes for longer #14446

Conversation

sfoster1 commented Feb 7, 2024

Testing

codecov bot commented Feb 7, 2024 • edited Loading

Codecov Report

SyntaxColoring left a comment

Choose a reason for hiding this comment

SyntaxColoring left a comment

Choose a reason for hiding this comment

codecov bot commented Feb 7, 2024 •

edited

Loading