Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add structured field and rule paths to Violation #265

Merged
merged 15 commits into from
Nov 26, 2024

Conversation

jchadwick-buf
Copy link
Member

@jchadwick-buf jchadwick-buf commented Oct 16, 2024

This PR introduces a new structured field path format, and uses it to provide a structured path to the field and rule of a violation.

  • The new message buf.validate.FieldPathElement is added.
    • It describes a single path segment, e.g. equivalent to a string like repeated_field[1]
    • Both the text name and field number of the field is provided; this allows the field path to be rendered into a string trivially without the need for descriptor lookups, and will work for e.g. unknown fields. (Example: A new field is marked required; old clients can still print the field path, even if they do not have the new field in their schema.)
    • It also contains the kind of field, to make it possible to interpret unknown field values.
    • Finally, it contains a subscript oneof. This contains either a repeated field index or a map key. This is needed because maps in protobuf are unordered. There are multiple map key entries, one for each distinctly encoded valid kind of map key.
  • The new message buf.validate.FieldPath is added. It just contains a repeated field of buf.validate.FieldPathElement
    • It would be possible to just have repeated buf.validate.FieldPathElement anywhere a path is needed to save a level of pointer chasing, but it is inconvenient for certain uses, e.g. comparing paths with proto.Equal.
  • Two new buf.validate.Violation fields are added: field and rule, both of type buf.validate.FieldPath. The old field_path field is left for now, but deprecated.
  • The conformance tests are updated to match the expectations.

Note that there are a number of very subtle edge cases:

  • In one specific case, field paths point to oneofs. In this case, the last element of the fieldpath will contain only the field name, set to the name of the oneof. The field number, field type and subscript fields will all be unset. This is only intended to be used for display purposes.
  • Only field constraints will output rule paths, because it is a relative path to the FieldConstraints message. (In other cases, constraint_id is always sufficient anyways, but we can change this behavior later.)
  • Custom constraints will not contain rule paths, since they don't have a corresponding rule field. (Predefined constraints will contain rule paths, of course.)

Implementations:

Copy link

github-actions bot commented Oct 16, 2024

The latest Buf updates on your PR. Results from workflow Buf CI / buf (pull_request).

BuildFormatLintBreakingUpdated (UTC)
✅ passed✅ passed✅ passed✅ passedNov 25, 2024, 5:23 PM

@jchadwick-buf jchadwick-buf marked this pull request as ready for review October 17, 2024 15:34
@jchadwick-buf jchadwick-buf marked this pull request as draft October 17, 2024 15:36
@jchadwick-buf
Copy link
Member Author

I finished adding rule_path to all violation cases in the conformance test suite, but unfortunately doing so did reveal a mistaken assumption I've been making: I've been thinking that we can just follow the field path and find the rule on its options, but, somewhat obviously in retrospect, the field the violation is on might not be the field that has the rule. Particularly in the repeated.items, map.keys and map.values cases. It might be possible to still work backwards by parsing the rule path and using it to back track from the field path...

@jchadwick-buf jchadwick-buf changed the title Add rule path to Violation message Add structured field and rule paths to Violation Nov 6, 2024
@jchadwick-buf jchadwick-buf marked this pull request as ready for review November 6, 2024 23:59
proto/protovalidate/buf/validate/validate.proto Outdated Show resolved Hide resolved
proto/protovalidate/buf/validate/validate.proto Outdated Show resolved Hide resolved
proto/protovalidate/buf/validate/validate.proto Outdated Show resolved Hide resolved
proto/protovalidate/buf/validate/validate.proto Outdated Show resolved Hide resolved
@jchadwick-buf
Copy link
Member Author

jchadwick-buf commented Nov 25, 2024

JH realized that our idea for handling the unknown map key types was not quite enough so I made a slight tweak: now FieldPathElement doesn't have a special case for sint* types and instead the map key and value protobuf kinds are conveyed directly. This is a little simpler anyways since it closer mirrors what you would want for both the reflection and unknown wire data cases. Adding map value type isn't strictly necessary, but it brings paths referring to primitives inside map values to parity with paths referring to primitives inside messages and repeated fields: it will be possible to display them even if the field is unknown. Seems like it's worth it for 2 bytes of wire data.

@jchadwick-buf jchadwick-buf merged commit 41573d9 into main Nov 26, 2024
6 checks passed
// value inside unknown fields through wire data.
optional google.protobuf.FieldDescriptorProto.Type value_type = 5;

// `subscript` contains a repeated index or map key, if this path element nests into a repeated or map field.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

repeated?

Copy link
Member Author

@jchadwick-buf jchadwick-buf Nov 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FieldPathElement is meant to be logically equivalent to a path segment, e.g. repeated[1] or map["key"]. The subscript contains either a repeated field index or a map key. If this wording is confusing, I'm happy to update the documentation.

WDYT about this? (would "list index" be better?)

Suggested change
// `subscript` contains a repeated index or map key, if this path element nests into a repeated or map field.
// `subscript` contains a repeated field index or map key, if this path element nests into a repeated or map field.

jchadwick-buf added a commit to bufbuild/protovalidate-python that referenced this pull request Nov 27, 2024
Implements the changes necessary to propagate field and rule paths in
Violation messages, passing conformance.

Depends on bufbuild/protovalidate#265.

DO NOT MERGE until the following is done:
- [x] bufbuild/protovalidate#265 is merged
- [x] Dependencies updated to stable version release
jchadwick-buf added a commit to bufbuild/protovalidate-cc that referenced this pull request Nov 27, 2024
Implements the changes needed to pass conformance after
bufbuild/protovalidate#265.

DO NOT MERGE until the following is done:
- [x] bufbuild/protovalidate#265 is merged
- [x] Dependencies updated to stable version release
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants