Fork me on GitHub

Oak XPath Query Grammar - Oak Documentation


Query

/jcr:root filter
 
...
 
( column
 
| ...
)
 
order by ordering
 
, ...
 
queryOptions

The “/jcr:root” means the root node. It is recommended that all XPath queries start with this term.

All queries should have a path restriction (even if it's just, for example, “/content”), as this allows to shrink indexes.

“order by” may use an index. If there is no index for the given sort order, then the result is fully read in memory and sorted before returning the first row.

The column list is usually not needed, as all properties of the nodes are returned in any case. It is only needed if non-standard (computed) columns such as “rep:excerpt” or “rep:spellcheck” are needed.

Examples:

Get all nodes of node type ‘sling:Folder’ with the property ‘sling:resourceType’ set to ‘x’:

/jcr:root/content//element(*, sling:Folder)[@sling:resourceType='x']

Get all index definition nodes of type ‘lucene’, sorted by reindex count, descending:

/jcr:root/oak:index/element(*, oak:QueryIndexDefinition)[@type='lucene'] order by @reindexCount descending

Get all nodes below /etc that have the property type set to ‘report’; fail the query if there is no index that can be used:

/jcr:root/etc//*[@type='report'] option(traversal fail)

Filter

/ directChildNodeFilter
/ / descendantNodeFilter
*
element (
 
nodeName
 
, nodeType
)
text()
[ constraint ]
( filter | unionFilter
 
...
)

A single slash means filtering on a specific child node, while two slashes means filtering on a descendant node.

“*” means any node name, and any node type.

“text()” is a shortcut for “jcr:xmltext”. It is supported only for compatibility.

Queries using the construct `(filter1 | filter2)' are converted to union, that is, one query is generated for each filter, and the result of both queries is combined. Note that for fulltext queries, it is problematic to use union, because scoring is done for each subquery individually. The score is not useful to compare results of different subqueries, so that the union of multiple fulltext queries won't be ordered by score as one might expect.

Examples:

Only direct child nodes of /content/dam:

/jcr:root/content/dam/*

All descendant nodes of /content/oak:index:

/jcr:root/content/oak:index//*

All nodes named “rep:policy” that have a parent node which is the direct child node of /home/users:

/jcr:root/home/users/*/rep:policy

All nodes named “profile” that have a parent node which is a descendant of /home/users:

/jcr:root/home/users//*/profile

All descendant nodes of /content that are of type oak:QueryIndexDefinition:

/jcr:root/content//element(*, oak:QueryIndexDefinition)

All descendant nodes of /content that are of type oak:QueryIndexDefinition and are named “lucene”:

/jcr:root/content//element(lucene, oak:QueryIndexDefinition)

All nodes named “indexRules” that have a parent node with the property “type” set to “lucene”:

/jcr:root/oak:index/*[@type = 'lucene']/indexRules

The paths ‘/libs’ and ‘/etc’ should be searched:

/jcr:root/(libs | etc)//*[@jcr:uuid and @jcr:mimeType = 'text/css']

Column

rep:excerpt (
 
 
relativePath /
@propertyName
)
rep:spellcheck ( )
rep:suggest (
 
.
)
rep:facet (
 
relativePath /
@propertyName )

“rep:excerpt”: include the excerpt in the result. Since Oak version 1.8.1, optionally a property name can be specified. See Excerpts and Highlighting.

“rep:spellcheck”: Include the spellcheck in the result. See Spellchecking.

“rep:suggest”: include suggestions in the result. See Suggestions.

“rep:facet”: include facets in the result. See Facets.

Examples:

/jcr:root/content//*[jcr:contains(., 'test')]/(rep:excerpt())

/jcr:root/content//*[jcr:contains(., 'test')]/(rep:excerpt(@jcr:title) | rep:excerpt())

/jcr:root/content//*[rep:suggest('in ')]/(rep:suggest())

/jcr:root/content//*[jcr:contains(@jcr:title, 'oak')]/(rep:facet(@tags))

Constraint

andCondition
 
or andCondition
 
...

“or” conditions of the form “@x = 1 or @x = 2” can use the same index.

“or” conditions of the form “@x = 1 or @y = 2” are more complicated. Oak will convert them to a “union” query (one query with @x = 1, and a second query with @y = 2).

Examples:

Include the nodes that have the property ‘hidden’ set to ‘hidden-folder’, or don't have the property set:

/jcr:root/content/dam/*[@hidden='hidden-folder' or not(@hidden)]

And Condition

condition
 
and condition
 
...

A special case (not found in relational databases) is “and” conditions of the form “@x = 1 and @x = 2”. They will match nodes with multi-valued properties, where the property value contains both 1 and 2.

Examples:

/jcr:root/home//element(*, rep:Authorizable)[@rep:principalName != 'Joe' and @rep:principalName != 'Steve']

Condition

comparison inComparison
 
not
( constraint )
( constraint )
jcr:contains (
 
propertyName
.
,
fulltextSearchExpression )
jcr:like ( dynamicOperand , staticOperand )
rep:similar ( propertyName , staticOperand )
rep:native ( languageName , staticOperand )
rep:spellcheck ( staticOperand )
rep:suggest ( staticOperand )

“fn:not”: this is used for both “is not null”, and for nested conditions. For example “fn:not(@status)” means nodes where this property is not set. In this case, and index can be used, but it is relatively expensive to index all nodes that don't have a certain property. It is recommended to only index this if there are relatively view nodes with this nodetype. See also nullCheckEnabled. Nested conditions, for example “fn:not(@status = ‘x’ and @color = ‘red’)”, can not typically use an index.

“jcr:contains”: see Full-Text Queries.

“jcr:like”: the wildcards characters are _ (any one character) and % (any characters). An index is used, except if the operand starts with a wildcard. To search for the characters % and _, the characters need to be escaped using \ (backslash).

“rep:similar”: see Similarity Queries.

“rep:native”: see Native Queries.

“rep:spellcheck”: see Spellchecking.

“rep:suggest”: see Suggestions.

Examples:

/jcr:root/var/eventing//element(*, slingevent:Job)[@event.job.topic = 'abc' and not(@slingevent:finishedState)]

/jcr:root/content//element(*, cq:Page)[@offTime > xs:dateTime('2020-12-01T20:00:00.000') or @onTime > xs:dateTime('2020-12-01T20:00:00.000')]

Comparison

dynamicOperand
=
< >
! =
<
< =
>
> =
staticOperand

Comparison using <, >, >=, and <= can use an index if the property in the index is ordered.

Examples:

@jcr:primaryType != 'nt:base'
@offTime > xs:dateTime('2020-12-01T20:00:00.000')

Static Operand

textLiteral
$ bindVariableName
 
-
numberLiteral
true
 
( )
false
 
( )
xs:dateTime ( literal )

A string (text) literal starts and ends with a single quote. Two single quotes can be used to create a single quote inside a string.

Examples:

'John''s car'
true()
false
1000
-30.3
xs:dateTime('2020-12-01T20:00:00.000')

Ordering

dynamicOperand
 
descending

Ordering by an indexed property will use that index if possible. If there is no index that can be used for the given sort order, then the result is fully read in memory and sorted there. If an index for ordering is used, then only entries are listed where the given property (or function) is not null.

As a special case, sorting by “jcr:score” in descending order is ignored (removed from the list), as this is what the fulltext index does anyway (and if no fulltext index is used, then the score doesn't apply). If for some reason you want to enforce sorting by “jcr:score”, then you can use the workaround to order by “fn:lowercase(@jcr:score) descending”. Note that for fulltext queries, it is problematic to use union, because scoring is done for each subquery individually. The score is not useful to compare results of different subqueries, so that the union of multiple fulltext queries won't be ordered by score as one might expect.

Examples:

order by @jcr:created descending

Dynamic Operand

 
relativePath /
@propertyName
*
fn:string-length ( dynamicOperand )
fn:name
fn:local-name
fn:path
(
 
.
)
jcr:score ( )
fn:lower-case
fn:upper-case
jcr:first
( dynamicOperand )
fn:coalesce ( firstDynamicOperand , secondDynamicOperand )

The “*” stands for any property. Using it in a condition requires a relative path. For example: [./* = 'test'] means where any property matches the word ‘test’. Relative path fragments can also contain * to represent ‘any’ node at that point. // is not supported as part of relative path. So, a/*/@test, */a/@test, a/*/*/@test etc are valid while a//@test, a/*/b//@test, etc are not.

“jcr:score()” is the score returned by the index.

“fn:coalesce”: This returns the first operand if it is not null, and the second operand otherwise. @since Oak 1.8

“jcr:first”, “fn:path”: Supported @since Oak 1.42, see OAK-9625.

Examples:

@type
./@jcr:primaryType
items/type/@metaType
indexRules/nt:base/*
fn:string-length(@title)
jcr:first(@alias)
fn:name()
fn:path()
jcr:score ()
fn:lower-case(@lastName)
fn:coalesce(@lastName, @name)

Options

option (
traversal
ok
warn
fail
default
index tag tagName
offset number
limit number
 
, ...
)

“traversal”: by default, queries without index will log a warning, except if the configuration option QueryEngineSettings.failTraversal is changed The traversal option can be used to change the behavior of the given query: “ok” to not log a warning, “warn” to log a warning, “fail” to fail the query, and “default” to use the default setting.

“index tag”: by default, queries will use the index with the lowest expected cost (as in relational databases). To only consider some of the indexes, add tags (a multi-valued String property) to the index(es) of choice, and specify this tag in the query. See Query Option Index Tag.

“offset” / “limit”: sets the offset / limit at the time of parsing the query See Query Option Offset / Limit.

Examples:

option(traversal fail)

Explain

explain
 
measure
query

Does not run the query, but only computes and returns the query plan. With “explain measure”, the expected cost is calculated as well. In both cases, the query result will only have one column called ‘plan’, and one row that contains the plan.

Examples:

exlplain measure /jcr:root//*[@jcr:uuid = 'x']

Result:

plan = [nt:base] as [nt:base] 
/* property uuid = 1 where [nt:base].[jcr:uuid] = 1 */  
cost: { "nt:base": 2.0 } 

This means the property index named “uuid” is used for this query. The expected cost (roughly the number of uncached I/O operations) is 2.


Measure

measure query

Runs the query, but instead of returning the result, returns the number of rows traversed. The query result has two columns, one called ‘selector’ and one called ‘scanCount’. The result has at least two rows, one that represents the total (selector set to ‘query’), and one per selector used in the query.

Examples:

measure /jcr:root//*[@jcr:uuid = 'x']

Result:

selector = query
scanCount = 0
selector = nt:base
scanCount = 0

In this case, the scanCount is zero because the query did not find any nodes.