[xquery-talk] Retrieving Sub Nodes

Mon Feb 18 12:51:39 PST 2008

Hi, XQueriers:

   I hope the subject line would kind of explain what I wanted to do. My source code is as follows:

    <my>
    <myHeader>
        <fileDesc>
            <titleStmt>
                <title>My Collection</title>
            </titleStmt>
            <sourceDesc>
                <biblFull>
                    <seriesStmt>
                        <title level="journal">My Journal</title>
                        <idno type="volume"> 1 </idno>
                        <idno type="issue"> 1 </idno>
                    </seriesStmt>
                </biblFull>
            </sourceDesc>
        </fileDesc>
    </myHeader>
    <text>
        <body>
            <div>
                <ad>
                    <head>Be Taller</head>
                    <p>If you wish to know the facts about HEIGHT INCREASE, send 35c for details to
                        TALL-UP.
                        <address>
                            <addressLine>TALL-UP,</addressLine>
                            <addressLine>232 W. Front St.</addressLine>
                            <addressLine>Napoleon, Ohio 42345</addressLine>
                        </address>
                    </p>
                    <note>Dept. NA</note>
                </ad>
                <ad>
                    <head type="main">FREE POWERFUL MUSCLES FAST</head>
                    <head type="sub">FREE</head>
                    <p>Fantastic mew discoveries in the science of body-building. Our method will
                        add inches of powerful muscles to arms, chest, shoulders &amp; legs.
                        Learn secrets on trimming the waist with ultra-modern methods—fast! Results
                        guaranteed! Send for free brochure. Send dime for postage and handling.
                        <address>
                            <addressLine>Universal Bodybuilding</addressLine>
                            <addressLine>Box 485</addressLine>
                            <addressLine>Dearborn, Michigan 48121</addressLine>
                        </address>
                    </p>
                    <note>Dept. P</note>
                </ad>
            </div>
        </body>
    </text>
</my>

My Current XQuery:

let $ad := fn:collection("xmldb:exist://db/cbml")
let $journal := $ad//seriesStmt/title[@level='journal'][1]
let $ad2 := $ad//ad
let $ad3 := $ad2/p/address[contains(upper-case(.), 'MUSCLE')]
let $sorted_result:=
for $doc in distinct-values($ad3)
order by $doc
return $doc
for $r at $count in $sorted_result
let $nodes := $ad2[. = $r][1]
let $seriesStmt:=$nodes/ancestor::cbml//seriesStmt
let $pb := $nodes/ancestor::cbml//pb[1]
return
<ad>
<statistics>
Showing Result: {$count} / {count($sorted_result)}
</statistics>
<source>
<journal>{data($journal)[1]}</journal>
<publication_information>Volume {data($seriesStmt/idno[@type='volume'][1])},
Issue {data($seriesStmt/idno[@type='issue'][1])}</publication_information>
</source>
{$nodes}
<information>Source from page {$pb}</information>
</ad>

This brings back the accurate result number, but there is no information within the nodes.

If I change the let $sorted-result to

let $sorted_result:=
for $doc in $ad2
where $ad2/p/address[contains(upper-case(.), 'MAX SEELEY')]
order by $doc
return $doc

the result does not bring back anything due to processing time out reasons.
Have I missed something here to extract the following?

The desired output:

<ad>
<statistics> Showing Result: 1 / 1</statistics>
<source>
<journal>My Collection</journal>
<publication_information>Volume 1 , Issue 1 </publication_information>
</source>
                          <head type="main">FREE POWERFUL MUSCLES FAST</head>
                    <head type="sub">FREE</head>
                    <p>Fantastic mew discoveries in the science of body-building. Our method will
                        add inches of powerful muscles to arms, chest, shoulders &amp; legs.
                        Learn secrets on trimming the waist with ultra-modern methods—fast! Results
                        guaranteed! Send for free brochure. Send dime for postage and handling.
                        <address>
                            <addressLine>Universal Bodybuilding</addressLine>
                            <addressLine>Box 485</addressLine>
                            <addressLine>Dearborn, Michigan 48121</addressLine>
                        </address>
                    </p>
                    <note>Dept. P</note>
                </ad>
</ad>

Any hint is appreciated.
======================================================
Alice Wei
MIS 2008
School of Library and Information Science
Indiana University Bloomington
ajwei at indiana.edu