in reply to [OT] Re^3: Examples or tutorials for Perl grammars?
in thread Examples or tutorials for Perl grammars?

Excuse my ignorance, but is there a standard for queries searching in JSON like nested structures?

Cheers Rolf
(addicted to the Perl Programming Language :)
Wikisyntax for the Monastery

  • Comment on Re: [OT] Re^3: Examples or tutorials for Perl grammars?

Replies are listed 'Best First'.
Re^2: [OT] Re^3: Examples or tutorials for Perl grammars?
by erix (Prior) on Jul 09, 2021 at 17:20 UTC

    The SQL Standard, in the part called SQL/JSON, describes syntax, yes. Some DBMSen have it implemented already (for quite some time). PostgreSQL has the functionality but not yet the required syntax.

    (In fact the implementation of that syntax for PostgreSQL is waiting to be committed, so I suppose it will be in version 15, next year. It works quite well, and I was able to use the ORACLE and DB2 documentation for SQL/JSON when I was trying it out and looking for examples of that standard-compliant syntax)

    PS
    There is a link to a SQL/JSON pdf (zipped) in the Bibliography part of the PostgreSQL docs:

    PostgreSQL Bibliography

    update: fixed typo's!

      Can I express xpath like queries, like find all elements which are descendants of another element but the direct parent is matching a special criterion?

      How would that look like in SQL?

      Edit

      I'm aware that xpath is for XML and not JSON. Just ignore the missing axes...

      Cheers Rolf
      (addicted to the Perl Programming Language :)
      Wikisyntax for the Monastery

        The SQL-standard's SQL/JSON description uses the term JSON PATH but does not refer to XPath. The postgres docs however say that jsonpath is inspired by XPath.

        "SQL/JSON path expressions specify the items to be retrieved from the JSON data, similar to XPath expressions used for SQL access to XML"

        Here is an example. The JSON table that I use here consists of millions of rows, each row with a so-called JSON document, not very large, and not very deeply nested. It seems to me that this is/will be typical usage. I retrieved test-JSON data from https://extendsclass.com/json-generator.html (h/t choroba).

        One example row with its JSON content:

        { "age": 63, "bool": true, "city": "New York City", "date": "1986-04-03", "enum": "json", "array": [ "Lila", "Vanessa", "Daune", "Brandise", "Jsandye" ], "regEx": "helloooooooooooooooo" "random": 78, "country": "Iceland", "friends": [ { "country": "CA", "lastname": "Yusuk", "firstname": "Deirdre" }, { "country": "AZ", "lastname": "Suanne", "firstname": "Elfreda" }, { "country": "FM", "lastname": "Magdalen", "firstname": "Britte" } ], "lastname": "Hailee", "firstname": "Jaime", "countryCode": "IE", "random float": 6.529, "array of objects": [ { "index": 0, "index start at 5": 5 }, { "index": 1, "index start at 5": 6 }, { "index": 2, "index start at 5": 7 } ], "email from expression": "Jaime@x.com" "email uses current data": "Jaime@x.com" }

        Here is a query example. The @@ is a postgresql operator, but the part in the single quotes is a jsonpath expression which is SQL/JSON-compliant.

        select id, js->'friends' as friends from jsontable where js @@ '$.friends[*].country == "AZ" && $.friends[*].lastname starts with "Y" ' ; id | friends ---------+------------------------------------- 110877 | [ | { | "country": "CA", | "lastname": "Yusuk", | "firstname": "Deirdre" | }, | { | "country": "AZ", | "lastname": "Suanne", | "firstname": "Elfreda" | }, | { | "country": "FM", | "lastname": "Magdalen", | "firstname": "Britte" | } | ] 116322 | [ | { | "country": "AZ", | "lastname": "Ietta", | "firstname": "Wendi" | }, | { | "country": "NG", | "lastname": "Yuille", | "firstname": "Giustina" | }, | { | "country": "MS", | "lastname": "Tengdin", | "firstname": "Adelle" | } | ] 117778 | [ | { | "country": "CK", | "lastname": "Fancie", | "firstname": "Fanny" | }, | { | "country": "JM", | "lastname": "Jerald", | "firstname": "Mureil" | }, | { | "country": "AZ", | "lastname": "Yam", | "firstname": "Wileen" | } | ] [[...]]

        This query returns ~1000 rows (above only 3 are shown) from a 3 GB table with 3 M rows. Thanks to an index on the JSON column (js), the query returns in 120 ms.

        YMMV. It is probably more interesting for you to try it out on a real instance. I tried a sqlfiddle but it runs an old (near-EOL) version of postgres (9.6) that understands JSON data but cannot query in the same way as I did here locally (@@ and @? not yet implemented). I did a different search there ( json1 @> json2, i.e., json2-in-json1 containment).

        sqlfiddle 1

        where js @> '{"country": "Thailand"}'

        sqlfiddle 2

          and js @> '{"friends": [{"country": "AE"}]}'

        db-fiddle has a newer postgres (13)

        https://www.postgresql.org/docs/current/datatype-json.html#DATATYPE-JSONPATH