Enable expressions in lists #537

NicoLaval · 2025-01-26T21:42:38Z

vpinna80 · 2025-01-27T07:56:42Z

Hi @NicoLaval, can you distinguish between "lists" and "listsComponent"?
To avoid introducing unwanted paths in the grammar, if "lists" uses "expr" in "inNotInExpr" then "listsComponent" uses "exprComponent" in "inNotInExprComponent".

Also I think we must change the documentation in order to clarify:

all items in the set must be scalars
all items in the set must be convertible to the valuedomain of the value being tested
-or- all items in the set must belong to the same valuedomain and the value being tested must be convertible to that domain.

This is to make sure that we can determine valuedomain errors at the semantic validation step.

NicoLaval · 2025-01-30T14:13:08Z

Hi @vpinna80,

G4 is updated, with 2 branches, one for scalars and one for components.

I also updated the name lists with list: I think the plural didn't make sense.

Regarding the docs, please provide your updates.

Shouldn't we also add examples with components?

javihern98 · 2025-01-30T14:34:14Z

I do not understand the use of listComponent, I believe the original proposal was fine if there are no ambiguities

NicoLaval · 2025-01-30T14:51:48Z

I do not understand the use of listComponent, I believe the original proposal was fine if there are no ambiguities

Maybe @vpinna80 wants to be able to write something like:

ds_out := ds_in [calc ind_me1 := me1 in { a, b }] where a and b are components of ds_in?

NicoLaval · 2025-01-30T14:54:42Z

I do not understand the use of listComponent, I believe the original proposal was fine if there are no ambiguities

Maybe @vpinna80 wants to be able to write something like:

ds_out := ds_in [calc ind_me1 := me1 in { a, b }] where a and b are components of ds_in?

Which on the other hand no longer allows us to compare a component one to constant scalars.

vpinna80 · 2025-01-30T15:12:05Z

Yes, only variables which are components can be used inside the square brackets, no other.

@NicoLaval Regarding the docs, i cannot update your pull request, but you should be able to edit the doc file directly on GitHub:
https://github.com/sdmx-twg/vtl/blob/Fix/improve-lists/v2.1/docs/reference_manual/operators/Comparison%20operators/Element%20of/content.rst

javihern98 · 2025-01-30T19:02:34Z

Yes, only variables which are components can be used inside the square brackets, no other.

Just to clarify, I understand by your suggestion the in operator will use therefore:

A set of scalars
A list of components, if used inside a calc if the names included in the list are also components of the dataset

Seeing both use cases, it still makes no sense to have both list and listComponent. I believe it goes out of scope of the issue anyway to add the compatibility of component names inside the in operator

hadrienk · 2025-01-31T09:06:58Z

I think adding listComponent is the only way to ensure that expr and exprComp are similar.

Why dont you think it make sense @javihern98 ?

javihern98 · 2025-01-31T09:11:20Z

I think adding listComponent is the only way to ensure that expr and exprComp are similar.

Why dont you think it make sense @javihern98 ?

Well the use of this token will we the same and have the same syntax whether we use it inside a calc or not, and we cannot use this token outside of the in operator. So to separate them in list and listComponent I find it redundant

hadrienk · 2025-01-31T09:11:45Z

Yes, only variables which are components can be used inside the square brackets, no other.

Why do you think external variables should no be possible?

In any case, i think this can easily be a runtime check, not a grammar check.

hadrienk · 2025-01-31T09:16:21Z

I think adding listComponent is the only way to ensure that expr and exprComp are similar.
Why dont you think it make sense @javihern98 ?

Well the use of this token will we the same and have the same syntax whether we use it inside a calc or not, and we cannot use this token outside of the in operator. So to separate them in list and listComponent I find it redundant

Ah, I see. I think this boils down to the duplicated expr branches then. I think we touched upon the subject in Salamanca. The whole exprComp rule branch only exists to account for a few disparities that only apply to components. I don't remember what they were but this really be fixed.

@NicoLaval didn't we create an issue about this?

We ended up patching the grammar in trevas to only have one expr rule.

NicoLaval · 2025-01-31T09:30:46Z

I think adding listComponent is the only way to ensure that expr and exprComp are similar.
Why dont you think it make sense @javihern98 ?

Well the use of this token will we the same and have the same syntax whether we use it inside a calc or not, and we cannot use this token outside of the in operator. So to separate them in list and listComponent I find it redundant

Ah, I see. I think this boils down to the duplicated expr branches then. I think we touched upon the subject in Salamanca. The whole exprComp rule branch only exists to account for a few disparities that only apply to components. I don't remember what they were but this really be fixed.

@NicoLaval didn't we create an issue about this?

We ended up patching the grammar in trevas to only have one expr rule.

Hi @hadrienk, @javihern98,
No we don't, other TF members were not really for this change.

Beyond producing a tree twice as big, this does not allow mixing the types (I will do a PR for the Levenshtein distance during the day, I will have the same problem).

In this context, for instance, consider:

ds_out := ds_in [calc a := me_1 in { me_2, "default" }][drop me_1, me_2];

With ds_in like:

id_1	me_1	me_2
1	"foo"	"foo"
2	"foo"	"bar"
3	"default"	"baz"

To produce ds_out:

id_1	me_1
1	true
2	false
3	true

This is not valid while we keep this expr/exprComponent distinction.

However, it seems to me that defining this script makes sense.

vpinna80 · 2025-01-31T09:43:34Z

Why is it not valid? I tested it with the change and it parses correctly.

expr:
    | left=expr op=(IN|NOT_IN)(lists|valueDomainID)                         # inNotInExpr

exprComponent:
    | left=exprComponent op=(IN|NOT_IN)(listsComponent|valueDomainID)          # inNotInExprComp

lists:
    GLPAREN  expr (COMMA expr)*  GRPAREN

listsComponent:
    GLPAREN  exprComponent (COMMA exprComponent)*  GRPAREN

NicoLaval · 2025-01-31T10:13:46Z

Why is it not valid? I tested it with the change and it parses correctly.

expr:
    | left=expr op=(IN|NOT_IN)(lists|valueDomainID)                         # inNotInExpr

exprComponent:
    | left=exprComponent op=(IN|NOT_IN)(listsComponent|valueDomainID)          # inNotInExprComp

lists:
    GLPAREN  expr (COMMA expr)*  GRPAREN

listsComponent:
    GLPAREN  exprComponent (COMMA exprComponent)*  GRPAREN

Syntactic validation is good, but if you resolve expr and exprComponent differently in your engine, it will bug.

And, if it doesn't bug, it's because you infer the types at runtime, and in this case, why keep the 2 branches?

NicoLaval · 2025-04-19T14:18:17Z

@linardian , why closed & deleted?

linardian · 2025-04-21T12:34:49Z

I fixed the error in 2.1. Since the Levenshtein operator was not defined according to the agreed syntax, to avoid confusion I dropped the whole changes. If you do not agree please let me know how to proceed Best Angelo.

NicoLaval · 2025-04-21T17:50:21Z

I fixed the error in 2.1. Since the Levenshtein operator was not defined according to the agreed syntax, to avoid confusion I dropped the whole changes. If you do not agree please let me know how to proceed Best Angelo.

Hi Angelo,

Just by restoring the branch, and let the discussion and code adjustments take place.

Thanks

hadrienk · 2025-04-21T18:09:34Z

I fixed the error in 2.1. Since the Levenshtein operator was not defined according to the agreed syntax, to avoid confusion I dropped the whole changes. If you do not agree please let me know how to proceed

Hi Angelo. Until a branch is "merged" it has no impact with the current version so there's no need to close/delete them.

linardian · 2025-04-28T09:24:04Z

Yes, you are right. But since we have just finished to build the baseline for 2.2, so I preferred to avoid any possible error.

Enable expressions in lists

6430096

NicoLaval requested review from vpinna80 and javihern98 January 26, 2025 21:43

NicoLaval mentioned this pull request Jan 26, 2025

Collections #506

Open

NicoLaval had a problem deploying to github-pages January 26, 2025 21:46 — with GitHub Actions Failure

NicoLaval added 2 commits January 30, 2025 14:51

Merge branch 'master' into Fix/improve-lists

9dc126f

Update list in G4

49610de

Merge branch 'master' into Fix/improve-lists

2a91ef9

linardian closed this Mar 27, 2025

linardian deleted the Fix/improve-lists branch March 27, 2025 13:00

Enable expressions in lists #537

Enable expressions in lists #537

Uh oh!

Conversation

NicoLaval commented Jan 26, 2025

Uh oh!

vpinna80 commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicoLaval commented Jan 30, 2025

Uh oh!

javihern98 commented Jan 30, 2025

Uh oh!

NicoLaval commented Jan 30, 2025

Uh oh!

NicoLaval commented Jan 30, 2025

Uh oh!

vpinna80 commented Jan 30, 2025

Uh oh!

javihern98 commented Jan 30, 2025

Uh oh!

hadrienk commented Jan 31, 2025

Uh oh!

javihern98 commented Jan 31, 2025

Uh oh!

hadrienk commented Jan 31, 2025

Uh oh!

hadrienk commented Jan 31, 2025

Uh oh!

NicoLaval commented Jan 31, 2025

Uh oh!

vpinna80 commented Jan 31, 2025

Uh oh!

NicoLaval commented Jan 31, 2025

Uh oh!

NicoLaval commented Apr 19, 2025

Uh oh!

linardian commented Apr 21, 2025 via email • edited by vpinna80 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicoLaval commented Apr 21, 2025 • edited by vpinna80 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hadrienk commented Apr 21, 2025

Uh oh!

linardian commented Apr 28, 2025

Uh oh!

Uh oh!

vpinna80 commented Jan 27, 2025 •

edited

Loading

linardian commented Apr 21, 2025 via email •

edited by vpinna80

Loading

NicoLaval commented Apr 21, 2025 •

edited by vpinna80

Loading