[xquery-talk] Sparksoniq 0.9.1 Spruce: first alpha release
gfourny at inf.ethz.ch
Wed Jan 24 05:51:56 PST 2018
I am happy to announce the first alpha release of Sparksoniq: 0.9.1 Spruce, under an Apache 2.0 license.
You can try it out with its shell and its documentation on
In a nutshell, this is a JSONiq engine [see below what it has to do with XQuery and XML] that runs seamlessly on top of Spark. This engine was developed at ETH Zurich in collaboration with Stefan Irimescu, who wrote his Master's thesis with Gustavo Alonso and me last year and did an amazing job.
The core idea is that FLWOR expressions (identical to XQuery’s) very naturally map to Spark transformations, which allows a declarative and functional encapsulation, hiding Spark, Java and Scala from the user. This is consistent with Edgar Codd’s data independence requirement. This is also to put in context with several other ongoing works on the XML side such as Apache VXQuery (on Hyrack), PAXQuery (on Flink), etc.
Keep it mind that this is an early version with not yet the full language, and we will do best effort to address all the bugs that will certainly be found, while keeping a specific focus on querying large-scale JSON datasets. We look forward to constructive comments, bug reports, feature wish lists, missing aspects in the documentation, etc.
With many thanks and kind regards,
I am adding a small note on JSONiq to give some context:
JSONiq, which some of you already know, is XQuery 3.0’s little brother and a cousin of XQuery 3.1. JSONiq’s DNA is 95% XQuery and it has all the expression machinery of XQuery, but adapted to specifically query JSON documents in a document-store-like setting, so as to be appealing to those in the JSON community who feel a bit uneasy about angle brackets, QNames, URIs and processing instructions.
More information about the talk