[xquery-talk] Union and Intersect Operators

=?GB2312?B?1dTA1w==?= zhaolei at gmail.com
Sun Mar 16 23:29:26 PST 2008


Hi,
Got your meaning. You can use the following function to removeduplicate nodes in the search result. In this function, a duplicatenode means it is deep-equal() with another node.
declare function local:distinct-nodes ($arg as node()*) as node()*{   for $a at $apos in $arg   let $before_a := fn:subsequence($arg, 1, $apos - 1)   where every $ba in $before_a satisfies not(fn:deep-equal($a, $ba))   return $a};
for example:
local:searchresult(local:distinct-nodes(collection("xmldb:exist://db/cbml")//ad/(head|note|p)[contains(upper-case(.),$search)]))
The function is based onhttp://www.w3.org/TR/xpath-functions/#func-distinct-nodes-stable.
Hope this helps.
Lei
On 3/16/08, Wei, Alice J. <ajwei at indiana.edu> wrote:> Hi,>>    I hope this is a more vivid example.>>  <ad>>  <statistics>>  Showing Result: 3 / 13</statistics>>  <head>Be Taller</head>>  <p>If you wish to know the facts about HEIGHT INCREASE, send 35c for details to TALL-UP.>         <address>>  <addressLine>TALL-UP,</addressLine>>  <addressLine>Dept. NA</addressLine>>  <addressLine>232 W. Front St.</addressLine>>  <addressLine>Napoleon, Ohio 42345</addressLine>>  </address>>  </p>>  <information>Source from page <pb n="4" xml:id="etn010001-006"/></information>>  </ad>>  ad>>  <statistics>>  Showing Result: 6 / 13</statistics>>  <head>Be Taller</head>>  <p>If you wish to know the facts about HEIGHT INCREASE, send 35c for details to TALL-UP.>   <address>>  <addressLine>TALL-UP,</addressLine>>  <addressLine>232 W. Front St.</addressLine>>  <addressLine>Napoleon, Ohio 42345</addressLine>>  </address>>  </p>>  <information>Source from page <pb n="4" xml:id="etn010001-006"/></information>>  </ad>>>  The query I had used for the union and intersect is>>     {local:searchresult(collection("xmldb:exist://db/cbml")//ad/(head|note|p)[contains(upper-case(.),$search)])},>>  and the term I used is TALL. Is there something I had done wrong in my union and intersect expression here that makes the result return even though it is the already extracted data snippet itself because it has both head and p tags with the same word?>>>  Thanks for your help.>>  Alice>  ======================================================>  Alice Wei>  MIS 2008>  School of Library and Information Science>  Indiana University Bloomington>  ajwei at indiana.edu>  ________________________________________>  From: ÕÔÀ× [zhaolei at gmail.com]>> Sent: Sunday, March 16, 2008 12:50 AM>> To: Wei, Alice J.>  Cc: talk at x-query.com>  Subject: Re: [xquery-talk] Union and Intersect Operators>>  It depends on how you define "duplicates". Nodes that are fn:deep-equal()?>  Or nodes that have the same string() value?>>  According to your description, it seems that another page does not>  contain <head> nodes. So I guess you may want to eliminate nodes that>  contain the same content with <head> (and <p>) nodes which are already>  found. Could you please paste a more complete example to describe your>  requirment? Thanks.>>  Lei>>  On 3/15/08, Wei, Alice J. <ajwei at indiana.edu> wrote:>  > Hi,>  >>  >   Sorry, that was a typo.>  >   As for the issue with the union operator, I only want anything that matches that, but I don't want duplicates. For example:>  >>  >  <ad>>  >  <statistics>>  >  Showing Result: 2 / 20</statistics>>  >         <head type="main"> The Key¡ªto your future! </head>>  >         <head type="sub">>  >  "Already master keypad my Elka Lodge and 38>  >                         apartments¡ªcourse is practically paid for!">  >  <emph rend="italics">Adelindo Orsi, Jr., Azusa, Calif.</emph>>  >  </head>>  >         <description>>  >  <p rend="bold">Be a LOCKSMITH Like Me/</p>>  >         <p>>  >  Add to your income with easy spare time earnings¡ªcash in on the>  >                         nationwide shortage of skilled locksmiths. Quickly step into a highly paid>  >                         position or a business of your own.>  >  </p>>  >                 </description>>  >         <information>Source from page <pb n="25"/>>  >  </information>>  >  </ad>>  >>  >   I specified that I want anything that is case insensitive with "KEY" in there. However, since I have another group with similar node structure in another page as indicated in <information> node, the same information is presented again. I don't have this problem when I only run the search asking for only <head> containing certain text, therefore I am guessing it must not be the distinct-values() issue.>  >>  >  Or, is it?>  >>  >  Thanks for your help.>  >>  >>  >  Alice>  >  ======================================================>  >  Alice Wei>  >  MIS 2008>  >  School of Library and Information Science>  >  Indiana University Bloomington>  >  ajwei at indiana.edu>  >  ________________________________________>  >>  > From: ÕÔÀ× [zhaolei at gmail.com]>  >  Sent: Friday, March 14, 2008 10:12 PM>  >  To: Wei, Alice J.>  >  Cc: talk at x-query.com>  >  Subject: Re: [xquery-talk] Union and Intersect Operators>  >>  >>  >  In your second query, "intercept" should be "intersect"?>  >>  >  Union is functioned on nodes. And two nodes nodeA and nodeB are>  >  duplicates only if they satisfy "nodeA is nodeB". In the xpath>  >  expression, tags <head>, <p>, <note> and <address> locate different>  >  nodes, which are not duplicates. Thus "union" gives all of them, and>  >  "intersect" gives none.>  >>  >  Regards,>  >  Lei>  >>  >  On 3/15/08, Wei, Alice J. <ajwei at indiana.edu> wrote:>  >  > Hi,>  >  >>  >  >  I have a strange output here from the web application using xquery. I intend to use the union operator here to combine all the results of the possible tags I intend to run search against, but it turned out that the $count variable below, which the similar XPath has been used to call the function as well for the query, does not exclude duplicates. What it appears to do is giving me all the tags that contain the the phrase $search to my query>  >  >>  >  > let $search := upper-case(request:get-parameter("search", ""))>  >  > let $count := count(collection("xmldb:exist://db/cbml")//ad/((head union p) union (note union //address))[contains(upper-case(.),$search)])>  >  >>  >  > When I used>  >  >>  >  > let $count := count(collection("xmldb:exist://db/cbml")//ad/((head union p) intercept (note union //address))[contains(upper-case(.),$search)])>  >  >>  >  > it gives me no output.>  >  >>  >  > However, the first line I had, it gives me back as many times the phrase $search has been used in the four parameters.>  >  > If anyone needs to see more of my query, I would love to provide it, but the query itself is very long.>  >  >>  >  > Can anyone please give me tips on what might have gone wrong here?>  >  >>  >  > Thanks to those who can help.>  >  >>  >  > Alice>  >  > ======================================================>  >  > Alice Wei>  >  > MIS 2008>  >  > School of Library and Information Science>  >  > Indiana University Bloomington>  >  > ajwei at indiana.edu>  >  >>  >  > _______________________________________________>  >  > talk at x-query.com>  >  > http://x-query.com/mailman/listinfo/talk>  >  >>  >>


More information about the talk mailing list