Automatically add action entities to `Entities`; validate entities against schema #360

cdisselkoen · 2023-10-13T18:29:10Z

Description of changes

Entities::from_*() now automatically adds action entities from the schema, if a schema is provided
Entities::from_*() now validates entities against the schema, if a schema is provided
Entities::add_*() now validates entities against the schema, if a schema is provided
Adds Entity::validate() to the public API, which validates a single entity against a schema
Entities::from_entities() and Entities::add_entities() now take an optional schema argument
Significant refactoring of entity-validation code in Core to facilitate all of the above changes
includes refactoring of some errors and tweaks to some error messages
some small refactors are also included in this PR that could technically be independent, e.g. passing schemas by reference to EntityJsonParser. If folks feel it's important, I can split those into a separate PR to declutter this one slightly. But this one is probably going to be a big PR regardless

Issue #, if available

Fixes #286

Checklist for requesting a review

The change in this PR is (choose one, and delete the other options):

A breaking change requiring a major version bump to cedar-policy (e.g., changes to the signature of an existing API).

I confirm that this PR (choose one, and delete the other options):

Updates the "Unreleased" section of the CHANGELOG with a description of my change (required for major/minor version bumps).

I confirm that cedar-spec (choose one, and delete the other options):

I'm not sure how my change impacts cedar-spec. (Post your PR anyways, and we'll discuss in the comments.)

Disclaimer

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

cdisselkoen · 2023-10-16T18:25:27Z

Pushed some tests and I'm marking this ready for review.

cedar-policy-core/src/entities.rs

cedar-policy-core/src/entities/json/jsonvalue.rs

cedar-policy-core/src/entities/conformance.rs

cedar-policy/src/api.rs

cedar-policy-core/src/entities/json/entities.rs

cedar-policy/src/api.rs

Co-authored-by: Andrew Wells <130512013+andrewmwells-amazon@users.noreply.github.com>

cdisselkoen · 2023-10-24T16:21:37Z

I remembered another reason why it might make sense to do it the way this PR currently has it (optional schema argument for the Entities constructors, standalone validation function for Entity). If you do the optional schema argument in both places, and pass the schema argument both times (which we should encourage users to do, if they have a schema), then this would result in double-validation on the Entities::from_entities() path, wasting work. We should try to have validation happen only once. Either it happens when constructing Entities, as currently in this PR, or it happens when constructing Entity and then Entities::from_entities() doesn't need to revalidate. The latter might make more sense, for the reasons you bring up. It might be a little more confusing; Entities::from_entities() probably still needs the optional schema argument for the purpose of adding schema actions, but won't be using that schema to perform validation. Nonetheless, it allows us to have an arguably better interface and avoid Entity::validate()? Thoughts?

khieta · 2023-10-24T16:58:38Z

Thinking about it some more... adding a schema argument to Entity::new might cause more problems than it solves. Like you said, we don't want to re-validate every Entity when constructing an Entities object. But in order to know whether an entity was already validated (and what schema it was validated with), we'd have to store a pointer to a (optional) schema in every entity.

So I retract my suggestion. I think it's ok to only do validation on constructing an Entities object, because this is the object that will be passed to the top-level is_authorized function. Users can use Entity::validate as a debugging tool in case they get confused when validation of a full Entities object fails.

New suggestion: update the comment for the public-facing validate function to say that this operation will be automatically performed when constructing an Entities object, so users don't wonder if they need to do Entity::validate in addition to passing the schema to Entities::from_*.

Re #191: now I'm torn on whether we want a separate validate function or an optional schema argument. The first would be consistent with Entity::new while the second would be consistent with Entities::from_*. Request::new is similar to Entities::from_* because it constructs an input for the top-level is_authorized function. But Entity::new and Request::new have the same name 🤔

cdisselkoen · 2023-10-24T17:45:11Z

in order to know whether an entity was already validated (and what schema it was validated with), we'd have to store a pointer to a (optional) schema in every entity.

I don't think this is necessarily the case. Since validation is optional anyway, we can just say we have an internal invariant that if you have an Entity object, it's already been validated to the extent you want it validated (and we ensure that you have the option to validate on every path to constructing an Entity object). Then Entities::from_entities() could skip validation based on this internal invariant.

Alternately if we want to be a little more explicit, we could just have a bool on each Entity indicating whether it's been validated, without referencing the schema it was validated with. This would work for the (vast majority of?) users who only have one schema, and who won't update their schema without restarting their app (and thus rebuilding the in-memory entity objects). It's a little unclear how we would use this bool, probably Entities::from_entities() could perform validation on anything not already validated (which only helps if Entities::from_entities() was given a schema but Entity::new() wasn't), maybe we could also expose a getter so users could check if a given entity was validated (but we don't currently offer that for Entities either; but we could).

Users can use Entity::validate() as a debugging tool

Yeah, I think this is the motivating use case for Entity::validate(), in the version in which it currently exists in this PR.

update the comment for the public-facing validate function ...

Yeah, this is certainly easy and good to do, if we stay with the way it currently is in this PR.

I'm still not convinced we don't want to add an optional schema argument to Entity::new() after all, and just have the internal invariant described above. As a bonus, users might get more targeted or immediate errors -- we can fail creating the Entity long before the user attempts to add it to an Entities.

Re #191

This is another reason why adding the optional schema argument to Entity::new() might make more sense.

cdisselkoen · 2023-10-26T14:44:42Z

I note that in the discussion on RFC 11 we decided not to add an optional schema argument to the existing Request::new(), but we also did not decide to write a separate Request::validate(). We proposed a middle ground, a Request::new_with_validation(). In the same spirit, in this PR we could have Entity::new_with_validation() instead of Entity::validate() or adding an optional argument to Entity::new(). This would still have the double-validation problem unless we removed validation from Entities::from_entities().

mwhicks1 · 2023-10-26T17:03:22Z

To add my 2c: I like the idea of just supporting validation on Entities, and leaving Entity::new() alone. @khieta's reasoning is persuasive to me:

I think it's ok to only do validation on constructing an Entities object, because this is the object that will be passed to the top-level is_authorized function.

On a different topic: While it might be useful to have an Entity::validate() for debugging purposes, adding it just increases the APIs that we need to support, going forward. I think there's something to be said keeping the number of APIs as small and simple as possible, unless there is a compelling need. (If you want to validate a single entity, can you not create a singleton Entities and validate that?) I'm not getting the sense there is a compelling need; if one emerges, we can add this later.

cdisselkoen · 2023-10-27T13:24:20Z

Makes sense to me. I have now removed Entity::validate(), and left everything else as it already was in the PR.

…ainst schema (cedar-policy#360) Co-authored-by: Andrew Wells <130512013+andrewmwells-amazon@users.noreply.github.com> Co-authored-by: Kesha Hietala <khieta@amazon.com>

cdisselkoen added 7 commits October 13, 2023 17:46

for entities parsed from json, add action entities from schema

2dca4c3

Entities::from_entities adds actions from schema

e439d17

refactor entity-schema-conformance errors

0b083d8

validate entities against schema

a741f9e

more

9ee479a

update changelog and add Entity::validate()

6916717

tests

f1f9040

cdisselkoen marked this pull request as ready for review October 16, 2023 18:25

cdisselkoen requested review from john-h-kastner-aws, khieta and shaobo-he-aws October 16, 2023 18:25

cdisselkoen added 2 commits October 16, 2023 18:43

cargo fmt

4c80128

changelog tweaks

84be668

cdisselkoen mentioned this pull request Oct 16, 2023

Bugfix/aaronjeline/358 #365

Merged

3 tasks

cdisselkoen added 2 commits October 17, 2023 14:08

fix clippy

debbb89

Merge remote-tracking branch 'origin/main' into add-action-entities

ca4987e

john-h-kastner-aws reviewed Oct 18, 2023

View reviewed changes

cedar-policy-core/src/entities.rs Outdated Show resolved Hide resolved

cedar-policy-core/src/entities/json/jsonvalue.rs Outdated Show resolved Hide resolved

cdisselkoen added 3 commits October 18, 2023 15:17

adjust wording in comments

4bee8c4

fix bad merge

38ba22b

cargo fmt

dd5082f

andrewmwells-amazon approved these changes Oct 18, 2023

View reviewed changes

cdisselkoen and others added 8 commits October 18, 2023 15:52

Apply suggestions from code review

e07d833

Co-authored-by: Andrew Wells <130512013+andrewmwells-amazon@users.noreply.github.com>

renames suggested in code review

59e2b2f

Update cedar-policy-core/src/entities/json/err.rs

84ad2b9

Co-authored-by: Andrew Wells <130512013+andrewmwells-amazon@users.noreply.github.com>

tweak doc comment

ee1acb6

Update cedar-policy-validator/src/schema.rs

03274ae

Co-authored-by: Andrew Wells <130512013+andrewmwells-amazon@users.noreply.github.com>

fix tests

bf23cf3

change wording in some docs

0eac125

change wording in some more docs

3f85344

fix doc comment

5426eea

cdisselkoen mentioned this pull request Oct 24, 2023

Better entity type suggestions when encountering unexpected entity types #380

Open

2 tasks

update doc comment on private field

ea002dc

cdisselkoen mentioned this pull request Oct 24, 2023

Should type_of_restricted_expr() mark all record attributes as optional? #382

Closed

2 tasks

cdisselkoen added 2 commits October 24, 2023 20:40

Merge branch 'main' into add-action-entities

6a59916

cargo fmt

1aef41d

shaobo-he-aws approved these changes Oct 24, 2023

View reviewed changes

reorg/simplify errors

de4e2ee

khieta approved these changes Oct 25, 2023

View reviewed changes

cdisselkoen added 3 commits October 26, 2023 14:14

Merge branch 'main' into add-action-entities

30b3e2c

Merge branch 'main' into add-action-entities

2186ae5

fix bad merge

8037b64

cdisselkoen added 2 commits October 26, 2023 18:16

Merge branch 'main' into add-action-entities

cce8ba8

remove Entity::validate()

588e76b

cdisselkoen added 3 commits October 27, 2023 13:41

fix tests

f1fc487

fix warning

e654278

Merge branch 'main' into add-action-entities

22ca435

khieta approved these changes Oct 27, 2023

View reviewed changes

cdisselkoen merged commit c4bbbb3 into main Oct 30, 2023

cdisselkoen deleted the add-action-entities branch October 30, 2023 13:08

khieta mentioned this pull request Oct 30, 2023

Optionally validate requests #393

Merged

3 tasks

sarahcec added the 3.0 label Nov 27, 2023

shaobo-he-aws pushed a commit that referenced this pull request May 20, 2025

updates for cedar#970 (#360)

6c4f954

Automatically add action entities to Entities; validate entities against schema #360

Automatically add action entities to Entities; validate entities against schema #360

Uh oh!

Conversation

cdisselkoen commented Oct 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of changes

Issue #, if available

Checklist for requesting a review

Disclaimer

Uh oh!

cdisselkoen commented Oct 16, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cdisselkoen commented Oct 24, 2023

Uh oh!

khieta commented Oct 24, 2023

Uh oh!

cdisselkoen commented Oct 24, 2023

Uh oh!

cdisselkoen commented Oct 26, 2023

Uh oh!

mwhicks1 commented Oct 26, 2023

Uh oh!

cdisselkoen commented Oct 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Automatically add action entities to `Entities`; validate entities against schema #360

Automatically add action entities to `Entities`; validate entities against schema #360

cdisselkoen commented Oct 13, 2023 •

edited

Loading