Re: [MarkLogic Dev General] transformation of document formats

2018-04-05 Thread Mary Holstege
ow...@developer.marklogic.com> When replying, please edit your Subject line so it is more specific than "Re: Contents of General digest..." Today's Topics: 1. transformation of document formats (Rajesh Kumar) 2. Re:

Re: [MarkLogic Dev General] Near Query with min-occurs

2018-04-04 Thread Mary Holstege
The semantics of this is a document in which prop* occurs at least 4 times, where one of those times is within distance 1 of tax. Here's the thing: index resolution of min-occurs requires positions of the appropriate type, and there are various cases where we can't tell in the indexes and

Re: [MarkLogic Dev General] transformation of document formats

2018-04-03 Thread Mary Holstege
On Tue, 03 Apr 2018 04:43:36 -0700, Rajesh Kumar wrote: > Hi Team, > > > Is there a way that word can be saved in MarkLogic and can be converted > to other > formats like PDF or ePUB within MarkLogic instead of calling external > API of a > different technology like

Re: [MarkLogic Dev General] Good Way to Automatical

2018-01-22 Thread Mary Holstege
There isn't a single API that orchestrates all the pieces, but there are APIs do do all the necessary parts in the pipelines and domains modules. These should be executed against your triggers database. If you share a triggers database, you don't need to do it all over again. p:insert to

Re: [MarkLogic Dev General] DB modifications and try/catch

2017-12-11 Thread Mary Holstege
On Mon, 11 Dec 2017 10:09:47 -0800, Florent Georges wrote: > Hi Mary, > > Thank you for this clear explanation. This is excellent news (news to me > at least), I always assumed try/catch were trickier than that WRT errors. > > Just out of curiosity, how is that possible

Re: [MarkLogic Dev General] DB modifications and try/catch

2017-12-11 Thread Mary Holstege
Updates within the body of the try will be abandoned; updates before or after will be kept. That is, as long as they are catchable exceptions. If you get an exception in the commit of the whole transaction, all transactions will be lost. That's as far as updates go. Some side-effects are

Re: [MarkLogic Dev General] xsl:output, character maps and document-insert

2017-11-08 Thread Mary Holstege
Character maps apply to serializations. What gets stored in the database isn't serialized: it is a data model instance, that isn't serialized unless it is (a) rendered as output from a query (b) saved to disk or (c) explicitly serialized as a string via xdmp:quote. There is a special

Re: [MarkLogic Dev General] encoding problem of utf-8 in marklogic

2017-10-16 Thread Mary Holstege
I think the issue is more likely to be about the encoding for serialization, not the encoding for input. Everything in MarkLogic is stored in UTF-8. The entity escapes you see are not what is stored: we store the actual characters. You can force the issue with output settings on the

Re: [MarkLogic Dev General] Or-Query: Synonym Scoring

2017-08-17 Thread Mary Holstege
Two things: (1) A lot depends on what your index settings are. You should always check the plan: xdmp:plan(cts:search(...)) You should see something like: 10071940103343762009 element(name1,word("tom")) 10071940126384762201

Re: [MarkLogic Dev General] Count of cts:element-values() not equal to number of element instances--what's going on?

2017-08-14 Thread Mary Holstege
That is overkill. The results you get out of cts:element-values have a frequency (accessible via cts:frequency). The cts: aggregates (e.g. cts:count, cts:sum) take the frequency into account. //Mary On Mon, 14 Aug 2017 11:42:07 -0700, Oleksii Segeda wrote: >

Re: [MarkLogic Dev General] Possible to Create Multi-Tagname Range Indexes Using admin Functions?

2017-07-31 Thread Mary Holstege
If you manually create an instance of db:range-element-index instead of using the constructor function, where in the localname element you have a space-separated list of the localnames you can do it. (If they have separate namespace URIs too, you need to separate them). You should make sure

Re: [MarkLogic Dev General] Connecting to MarkLogic 9ea on CentOS via ODBC

2017-03-24 Thread Mary Holstege
The error is misleading: the real problem is that the ODBC port does not communicate via the HTTP protocol and it reads HTTP messages as garbage. It expects to be communicating with the Postgres backend protocol. //Mary On Fri, 24 Mar 2017 08:04:46 -0700, Nick Heidke

Re: [MarkLogic Dev General] Unfiltered, exact searches

2017-03-23 Thread Mary Holstege
Not so much a bug, as a consequence of how indexing works. Value queries, even exact values queries are word searches with a spanning constraint. If you really want an equality search, set up a range index and do a range query. What is going on is that the index key for "new" (value exact) is

Re: [MarkLogic Dev General] Regular Expressions

2017-03-22 Thread Mary Holstege
On Wed, 22 Mar 2017 12:55:27 -0700, Oleksii Segeda wrote: > Hi everyone, > > Quick questions regarding regex in ML: > > > 1. What's ML alternative to word boundaries \b? Seems that > fn:analyze-string doesn't support this special character. > > 2. Does

Re: [MarkLogic Dev General] hyphens and cts:element-value-query

2017-02-28 Thread Mary Holstege
Just to summarize the ins and outs here in one place, because I'm seeing a certain amount of confusion: * xdmp:plan is your friend: it will show you the questions we ask the indexes. If you see some information from your query is not reflected in the plan, that will be a case where you might

Re: [MarkLogic Dev General] Is xdml:unquote appropriate for handling accent characters?

2017-02-09 Thread Mary Holstege
There are output serialization options for when you return results from MarkLogic that will cause certain sets of characters to be emitted using named entities, such as You can set the on the appserver level. The setting you want to adjust is output-sgml-character-entities; set it to

Re: [MarkLogic Dev General] How to fix the SQLite error : Too many attached databases

2017-01-25 Thread Mary Holstege
On Wed, 25 Jan 2017 05:20:42 -0800, Vijay Marella wrote: > Hi All, > > I encountered an issue with the below error message when I tried to > query a view > from Mark Logic Query console in SQL mode. Please see screen shot for > details > and suggest how to fix the

Re: [MarkLogic Dev General] cts;and-not-query working as expected

2016-12-15 Thread Mary Holstege
On 12/15/2016 08:26 AM, Tim Meagher wrote: Hi folks, The following query is looking for a record that contains a first but not last name. The last name may or may not have an element associated with it. While I believe I am using the proper syntax, I think the problem is due to the

Re: [MarkLogic Dev General] Relevance Info

2016-12-15 Thread Mary Holstege
Yes, stemmed searches use language-specific keys in the index, so the IDFs of these are calculated independently. //Mary On 12/15/2016 08:34 AM, Andreas Hubmer wrote: Hi, It seems that the weight is - at least partly - language dependent. After changing the xml:lang at the root element of a

Re: [MarkLogic Dev General] Relevance Info

2016-12-15 Thread Mary Holstege
The weight comes from the term weight in the query (by default 1), scaled by the IDF. So the fewer the fraction of documents the term appears in, the higher the weight will be (assuming default logtfidf scoring). //Mary On 12/15/2016 07:36 AM, Andreas Hubmer wrote: Hi, I'm tuning the result

Re: [MarkLogic Dev General] stem of messaging

2016-11-10 Thread Mary Holstege
Probably because 'message' as a verb was until recently a relative obscure usage and therefore it isn't in the dictionary/rule-sets of our stemming vendor. You can add it to your custom dictionary if you want to improve the recall here. //Mary On 11/10/2016 07:08 AM, Steiner, David J.

Re: [MarkLogic Dev General] Inserting documents with XSLT

2016-10-07 Thread Mary Holstege
On Fri, 07 Oct 2016 08:52:18 -0700, Hans Hübner wrote: > Hi folks, > > is there a way to make xsl:result-document insert the document into the > database rather than returning it to the XQuery that invoked the XSLT > transformation? We know that we can use

Re: [MarkLogic Dev General] Validate XML documents while ingesting and maintain logs of failed documents #CGO#

2016-09-28 Thread Mary Holstege
You need to throw an error to abort the whole transaction. Call fn:error from the catch block. //Mary On Sep 27, 2016 10:57 PM, "Jain, Abhishek" wrote: Hi Greet, I tried to implement pre-commit solution. In my action module I am trying check if input xml

Re: [MarkLogic Dev General] Speeding up xquery returning aggregates

2016-09-23 Thread Mary Holstege
On Fri, 23 Sep 2016 09:36:16 -0700, Mark Shanks wrote: ... > > I'm still unclear of what is going on under the hood in Marklogic. The > following link (https://docs.marklogic.com/guide/search-dev/lexicon) > talks about value co-occurrrence lexicons. If this is

Re: [MarkLogic Dev General] Sorting results of lexicon calls

2016-09-21 Thread Mary Holstege
On Wed, 21 Sep 2016 07:51:57 -0700, Andreas Hubmer wrote: > Thanks, Mary. I've been consistently wondering what sample/truncate are > doing. Now I got an idea. > > Do you know of any way to get the URIs sorted by relevance? > Using sample/truncate I can get the 5

Re: [MarkLogic Dev General] Sorting results of lexicon calls

2016-09-21 Thread Mary Holstege
On Wed, 21 Sep 2016 06:34:15 -0700, Andreas Hubmer wrote: > Hi Erik, > > Thanks, cts:value-tuples is a good hint. > Do you have an idea of how one could get the URIs, sorted by relevance > (of > some word query for instance)? > > I know what the option

Re: [MarkLogic Dev General] XQuery file extensions

2016-09-20 Thread Mary Holstege
On Tue, 20 Sep 2016 08:20:17 -0700, Joe Wicentowski <joe...@gmail.com> wrote: > Hi Mary, > > On Tue, Sep 20, 2016 at 10:55 AM, Mary Holstege > <mary.holst...@marklogic.com> wrote: >> The file extension 'xqm' is not pre-configured to be mapped to anything >

[MarkLogic Dev General] u

2016-09-20 Thread Mary Holstege
On Tue, 20 Sep 2016 05:06:54 -0700, Max-Gerd Retzlaff wrote: > Hi, > > We stumbled upon occasional "hiccups" when updating modules in module > databases, and have the suspicion that this might be related to the > module caching, and the use of the file

Re: [MarkLogic Dev General] CPF - No triggering.

2016-09-13 Thread Mary Holstege
On Tue, 13 Sep 2016 06:43:13 -0700, Kanugovi, Santhosh wrote: > Hello, > > We have configured CPF for alerting. Enabled CPF diagnostics. And when a > matching document is ingested we don't see any CPF events logs in the > error log. What is that we are

Re: [MarkLogic Dev General] Regarding cts:element value query

2016-09-02 Thread Mary Holstege
On Thu, 01 Sep 2016 23:57:01 -0700, Mani, Sivasubramani (ELS) wrote: > Hi Team, > > How do I search SPACE,QUESTION mark is the only value for the particular > element > using cts:element-value-query but it results wrongly. > > My query was > > Let $estiamte := >

Re: [MarkLogic Dev General] Wildcard search in marklogic

2016-08-24 Thread Mary Holstege
Three things: (1) The trailing wildcard index contains exactly entries for words that begin with certain 3+ character sequences, so "murd*" would be a single term lookup that will match murd, murdy, murds, etc. (2) The three character wildcard index is actually capable of distinguishing a 3

Re: [MarkLogic Dev General] root collation vs unicode collation in terms of performance

2016-08-23 Thread Mary Holstege
On Tue, 23 Aug 2016 08:46:40 -0700, Tim Meagher wrote: > Just wondering why MarkLogic does not make codepoint the default > collation > if it results in a 10% performance improvement. > > > Tim Let's not confuse the default appserver collation with the collation you might

Re: [MarkLogic Dev General] Secondary Sort for documents with same score

2016-08-09 Thread Mary Holstege
essage- From: general-boun...@developer.marklogic.com [mailto:general-boun...@developer.marklogic.com] On Behalf Of Mary Holstege Sent: Monday, August 08, 2016 9:10 PM To: general@developer.marklogic.com Subject: Re: [MarkLogic Dev General] Secondary Sort for documents with same score On Mon,

Re: [MarkLogic Dev General] Secondary Sort for documents with same score

2016-08-08 Thread Mary Holstege
On Mon, 08 Aug 2016 07:59:10 -0700, Prakashkumar Patel wrote: > Hi, > > We are using score-simple method for score calculation. We are getting > multiple documents with same score in result. Do we know what is the > secondary sorting algorithm that ML uses in

Re: [MarkLogic Dev General] word-query including punctuation characters

2016-06-29 Thread Mary Holstege
On Wed, 29 Jun 2016 08:06:35 -0700, Wissam Asfahani (TSO GB) wrote: > Good afternoon, > > We are having some issues estimating the number of documents when > performing word queries containing punctuation characters. > > I have attached 4 sample documents. When

Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-04-14 Thread Mary Holstege
cts:stem will show the alternative stems, but basic stemming will only use the first stem given. Stemmed search matching depends on matching stem to stem. In basic stemming, that means matching on the first stem; in advanced stemming that means matching on any of the stems. So, consider your

Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-04-12 Thread Mary Holstege
rn stemming off entirely; principally useful where you searching over non-linguistic content //Mary > > Thanks, > Praveen. > > On Thu, Mar 31, 2016 at 12:58 PM, Mary Holstege > <mary.holst...@marklogic.com >> wrote: > >> >> Do you have advanced stemming enab

Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-03-31 Thread Mary Holstege
Do you have advanced stemming enabled? With basic stemming only the first stem returned from cts:stem indexed and used for matching in search. //Mary On 03/31/2016 03:00 AM, Debin, Infant Jerald (LNG-CON) wrote: Hi Team, For the term French term “disparu” corresponding French stemmed word

Re: [MarkLogic Dev General] Marklogic Collation to not ignore spaces while ignoring punctuation and symbols

2016-03-30 Thread Mary Holstege
On Wed, 30 Mar 2016 05:42:02 -0700, Blessing N wrote: ... > 1.Alphabetize letter by letter from A to Z. > 2.Ignore the capitalization of letters > 3.Ignore mathematical symbols and any special characters that do not > include a Latin letter > 4.Ignore punctuation > 5.

Re: [MarkLogic Dev General] French Ligature (Æ and æ) is not getting recognized in Marklogic

2016-03-22 Thread Mary Holstege
On Tue, 22 Mar 2016 08:38:31 -0700, Debin, Infant Jerald (LNG-CON) wrote: > Hi Team, > > When we give the French ligature (Æ and æ) in our word query, it is not > getting recognized as ligature and no results are returned. > > But when we use ligature (Œ

Re: [MarkLogic Dev General] XPath 3.0 operators in XSLT

2016-03-07 Thread Mary Holstege
XPath in an XSLT 2.0 context in MarkLogic uses strict 2.0 conformance mode (same as if you declare the XQuery version as 1.0). There is a hook to use different dialects instead and these have the same impact on XPath in XSLT as they do on XPath in XQuery. Add the attribute xdmp:dialect

Re: [MarkLogic Dev General] Wildcard Searches | MultiLingual Data | Cts Queries

2016-01-12 Thread Mary Holstege
There are two things going on here: (1) Language only applies to stemmed searches, and wildcarded searches are not stemmed. So your lang=zh is irrelevant. If this were a non-wildcarded search, your lang=zh would still not work as you expect in this case, because: (1) MarkLogic performs some

Re: [MarkLogic Dev General] Thesaurus

2015-12-03 Thread Mary Holstege
On 12/03/2015 07:07 AM, Kari Cowan wrote: I am starting to look at the Thesaurus API. Is there any difference between ML 7 and 8.x in these regards? https://docs.marklogic.com/7.0/thsr Only that there is now a JSON format for some of the functions as well for convenience when you use the

Re: [MarkLogic Dev General] CPF and limited-privilege users

2015-12-02 Thread Mary Holstege
On Wed, 02 Dec 2015 11:31:58 -0800, Katherine Ford wrote: > I have a CPF domain that includes some documents inserted via XCC. XCC > uses > credentials for a limited user, and the XCC request calls an amped > function to > insert the documents. When the documents are

Re: [MarkLogic Dev General] Issue with special / foreign language characters in ML rest search

2015-11-18 Thread Mary Holstege
MarkLogic doesn't index punctuation characters (Unicode class P) except for "exact" value queries. Therefore a word query or a value query that does not have the "exact" option cannot be resolved precisely by the index, only by the filter. So the index returns false positives and if you

Re: [MarkLogic Dev General] Range queries with weights below 0.5 ignored

2015-11-10 Thread Mary Holstege
On Tue, 10 Nov 2015 02:20:38 -0800, Chris Rimmer wrote: > Does anyone have an idea about this issue? It seems like a bug to me. I would look at the relevance trace information. There is a lot of bucketing and caps involved in score calculations, and it could just

Re: [MarkLogic Dev General] Skipping some documents from range indexing during ingestion

2015-11-05 Thread Mary Holstege
On 11/05/2015 06:50 AM, Tyagi, Devesh wrote: Hi, I have created range indexes on some elements in my database. Now I want to ingest some skeleton documents without any values in the concerned elements. Marklogic reports error, mentioning that the values have invalid lexical values. Is there

Re: [MarkLogic Dev General] Slope factor granularity not as expected

2015-10-30 Thread Mary Holstege
There are some interactions with range query caching (true by default) and some internal limits, and some fixes we made that made the information in the tables incorrect. There are some further fixes/changes coming in 8.0-4 in this area., too. In the meantime you can adjust your slope

Re: [MarkLogic Dev General] Format Date in SQL query on Marklogic 7

2015-10-28 Thread Mary Holstege
On Wed, 28 Oct 2015 07:16:00 -0700, shruti kapoor wrote: > Hi all > > I have an xml like > > 20120101 > ABC > > > > I am using ODBC server. So I have created an element range index on > admissionDate and added in my SQL view. > > I need to display the date in the

Re: [MarkLogic Dev General] Word Query - Excluded element Question

2015-08-26 Thread Mary Holstege
On Wed, 26 Aug 2015 00:49:17 -0700, David Ennis david.en...@hinttech.com wrote: I was hoping someone would have a better answer before I replied, but here is my response. Hopefully others will clarify / build on it. I do not think this will make a difference. The reason being that I

Re: [MarkLogic Dev General] How to add CDATA to xml content - reg.,

2015-08-24 Thread Mary Holstege
On Tue, May 5, 2015 at 9:40 AM, Mary Holstege mary.holst...@marklogic.com wrote: If you really want CDATA escaping for certain elements, you can use the output options to control this. See https://docs.marklogic.com/guide/app-dev/appserver-control#id_86940 for how to set output options

Re: [MarkLogic Dev General] CPF- Not working

2015-08-20 Thread Mary Holstege
Enable the CPF trace events: Go to the Diagnostics tab under Groups/Default. Enable trace events, and type CPF into the box. Then try again. You should see some trace in the log about what CPF sees and what it is doing. Also, check the properties of your file to see if there is an error

Re: [MarkLogic Dev General] path range index

2015-08-18 Thread Mary Holstege
On Tue, 18 Aug 2015 10:27:44 -0700, Paul M pjm...@yahoo.com wrote: Suppose I want all the values for a specific path range index.Normally, I would use cts:element-values and cts:frequencey on a range index. How would I do this on a path range index, if possible. Thank You. Yes, use

Re: [MarkLogic Dev General] Custom dictionary for stemming

2015-07-22 Thread Mary Holstege
It may be a tokenization thing -- the apostrophe is causing a word break so your custom stem is never matched. What does this give you: cts:tokenize(cts:stem(Int'l))? Do things work as you expect for a custom stem that doesn't have a punctuation character in it? A workaround for that is to

Re: [MarkLogic Dev General] cts:element-values is not returning result in input order

2015-07-16 Thread Mary Holstege
cts:element-values can either return the full values based on the ordering of those values (the full values) or on the frequency of the occurrences of those values. Since A and AXYZ aren't values of the element given to element-values, you can't return things based on the ordering of those. It

Re: [MarkLogic Dev General] Is a cts:query looking at specific attributes only within the same element possible?

2015-07-15 Thread Mary Holstege
There are a couple things going on here: (1) Queries do matching per fragment, so if you do an and query of two value queries or range queries, there is no constraint that the relationship elements be the same instance in the fragment. (2) Wrapping an element-query on relationship around the

Re: [MarkLogic Dev General] Schema Validation Pipeline

2015-06-23 Thread Mary Holstege
On Tue, 23 Jun 2015 08:24:31 -0700, Aluri, Venkata Seshu Prabhakar venkata.al...@optum.com wrote: I am trying to use the Schema Validation pipeline to validate the XML documents on ingestion. I did not see any option to set the validation -mode option to strict. Can someone point me to

Re: [MarkLogic Dev General] CPF Pipeline Issue

2015-06-18 Thread Mary Holstege
It looks to me like you have a syntax error. It should be xi:include={$cpf:document-uri}/ //Mary On 06/18/2015 11:37 AM, Aluri, Venkata Seshu Prabhakar wrote: I have created a CPF pipeline on my database and I am trying to create a new document in my action module on “initial” state. This new

Re: [MarkLogic Dev General] TaskServer: state is already successor so not updating for /alerts/email/15835948930718685389.xml

2015-06-04 Thread Mary Holstege
On 06/04/2015 07:20 AM, Christopher Hamlin wrote: Hi, This happens on success, when the next state is the same as the old state. In your pipeline I see: state-transition statehttp://marklogic.com/states/sent/state on-successhttp://marklogic.com/states/sent/on-success

Re: [MarkLogic Dev General] Phrase query mismatch Marklogic 7

2015-05-27 Thread Mary Holstege
On Wed, 27 May 2015 09:06:20 -0700, Will Thompson wthomp...@jonesmcclure.com wrote: It sounds like you want stemming set to off instead of basic -Will Or explicitly add the option unstemmed to your query. //Mary On May 27, 2015, at 10:38 AM, Shashidhar Rao raoshashidhar...@gmail.com

Re: [MarkLogic Dev General] Calculating Node size

2015-05-06 Thread Mary Holstege
Use fn:string instead of xdmp:quote On May 6, 2015 5:32 AM, Jonna Marry nett77v...@gmail.com wrote: Hi, We need to calculate the character length of a Node. By applying xpath we are getting the sequence of nodes. We need to calculate the length of each node along with nodename and text values.

Re: [MarkLogic Dev General] How to add CDATA to xml content - reg.,

2015-05-05 Thread Mary Holstege
If you really want CDATA escaping for certain elements, you can use the output options to control this. See https://docs.marklogic.com/guide/app-dev/appserver-control#id_86940 for how to set output options at the appserver level and

Re: [MarkLogic Dev General] Document not visible in WebDAV listing

2015-05-02 Thread Mary Holstege
Is the URI RCC06032011.xml or /RCC06032011.xml? WebDAV needs rooted paths to work. //Mary On 05/02/2015 05:42 AM, Danny Sinang wrote: We have a database that ML reports as having 31,000+ documents which appear to be mostly in the root directory of that database. When I try to open the said

Re: [MarkLogic Dev General] no schema validation error in MarkLogic but in 'Eclipse' and 'XMLStarlet'

2015-04-08 Thread Mary Holstege
It sounds like bug. Please report it. //Mary On Apr 8, 2015 5:19 AM, Vanhoutte Michiel michiel.vanhou...@wolterskluwer.be wrote: Hi, In attachment the following documents: mnrs6108899.xml xhtml5-ml-wkb.xsd semantics.xsd If I validate this document ‘mnrs6108899.xml’ outside MarkLogic

Re: [MarkLogic Dev General] Title extraction

2015-04-07 Thread Mary Holstege
On Tue, 07 Apr 2015 13:03:27 -0700, Robert De Vivo rdev...@virtify.com wrote: I have a requirement to extract study titles from clinical documents in PDF and MS Word formats. There is no reliable pattern to the text or the formatting of the titles, so my options for direct querying are

Re: [MarkLogic Dev General] Fwd: question about xdmp:encoding-language-detect

2015-03-27 Thread Mary Holstege
On Fri, 27 Mar 2015 08:44:35 -0700, Jakob Fix jakob@gmail.com wrote: Thanks Mary for your quick reply. It's an explanation that I understand, but this doesn't resolve my initial problem. Any idea how to solve this in the short term and whether there are improvements in the pipeline? Or

Re: [MarkLogic Dev General] Fwd: question about xdmp:encoding-language-detect

2015-03-27 Thread Mary Holstege
On Fri, 27 Mar 2015 08:23:19 -0700, Jakob Fix jakob@gmail.com wrote: Hello, I think this message got lost when the mailing list was down in February (or nobody has an answer ...) Thanks, Jakob. The xdmp:encoding-language-detect uses the ICU libraries to do the detection. Serbian and

Re: [MarkLogic Dev General] Polygon intersection over large document collections

2015-03-13 Thread Mary Holstege
So if I'm interested in finding documents whose location (Point, Polygon or LineString) intersects a given box I can run this type of query: cts:search(doc(), cts:path-geospatial-query(location/coordinates, cts:box(-10, -10, 10, 10), type=long-lat-point))[1 to 10] This works when

Re: [MarkLogic Dev General] bug or feature, random sequence order

2015-02-18 Thread Mary Holstege
The / says to put things in document order, but these are all in separate documents so the relative order is undefined. You can either wrap them in a single root element so they are part of the same document with a defined relative order, or use the ! operator instead. //Mary On Feb 18, 2015

Re: [MarkLogic Dev General] NOT_IN - phrase search

2015-02-18 Thread Mary Holstege
Make sure you have positions enabled. Not in requires them to work unfiltered. In this case that would be word positions. //Mary On Feb 18, 2015 6:14 AM, Dave Cassel dave.cas...@marklogic.com wrote: Dinesh, I just ran a test with the REST API and got the expected results. Could you give more

Re: [MarkLogic Dev General] Content-Processing for docx Documents

2015-02-05 Thread Mary Holstege
There are a couple of very chatty trace events that tell you exactly what CPF is seeing while it processes your documents: CPF Condition Invoke CPF Condition Result CPF Action Invoke CPF Action Complete The condition invoke/result pairs will tell you which conditions in the pipelines are being

Re: [MarkLogic Dev General] EBCDIC file to ASCII conversion

2015-01-28 Thread Mary Holstege
It turns out the encoding isn't applied to a document loaded as a binary file from the filesystem: that is just loaded with the bits as handed to the call. So load this as a text document (put formattext/format in the options). Since you expect to get ASCII out, that seems the reasonable

Re: [MarkLogic Dev General] Unfiltered search for wildcarded patterns.

2015-01-14 Thread Mary Holstege
On Wed, 14 Jan 2015 09:43:14 -0800, Rahul Gupta rahul.gu...@nagarro.com wrote: I need to run a cts:element-word-query on documents through wildcard search. I already have wildcard searched on for 3 character as well as created an index for the element. When I am running unfiltered search

Re: [MarkLogic Dev General] Running reverse-query over collection-query

2015-01-12 Thread Mary Holstege
On Mon, 12 Jan 2015 04:23:14 -0800, Ashish Bhardwaj napster.ash...@gmail.com wrote: ... let $query := query{cts:collection-query(test)}/query let $x := testaa/test (: any modification is allowed to run collection query over it :) return cts:contains($query, cts:reverse-query($x)) You

Re: [MarkLogic Dev General] Combining trigger databases

2014-12-19 Thread Mary Holstege
On Fri, 19 Dec 2014 06:11:27 -0800, Danny Sinang d.sin...@gmail.com wrote: We have 5 trigger databases (one for each Business Unit content database with its own CPF domain and pipelines). The less than 30 triggers stored in each of them appear to have been created solely by CPF. For

Re: [MarkLogic Dev General] Stemming/diacritics

2014-12-16 Thread Mary Holstege
On Tue, 16 Dec 2014 00:13:29 -0800, vi...@tilaton.fi wrote: Forgot to send this to the list, sorry about that. In addition to what is below, one more question: can we do multilanguage searches? We have content in multiple languages, and the end user is searching with keywords or

Re: [MarkLogic Dev General] Stemming/diacritics

2014-12-15 Thread Mary Holstege
On Mon, 15 Dec 2014 02:31:51 -0800, vi...@tilaton.fi vi...@tilaton.fi wrote: Hi we're trying to build a search that would find all words müller, muller and mueller using any of the three words. We've got müller and muller working as expected, but can't get mueller to play nice. (Or

Re: [MarkLogic Dev General] Schema validation incorrect for no-namespace document

2014-12-09 Thread Mary Holstege
On Tue, 09 Dec 2014 12:42:29 -0800, Will Thompson wthomp...@jonesmcclure.com wrote: I recently ran into some issues validating a no-namespace document. The schema was updated, which should have caused the document to fail validation, but it didn't. I have been using

Re: [MarkLogic Dev General] Geo-spatial API - Roadside Distance

2014-11-13 Thread Mary Holstege
Just to augment that: If you are interested in as-the-crow-flies distance, we do have cts:distance and cts:shortest-distance available. //Mary On 11/12/2014 10:46 PM, Wayne Feick wrote: No, that would require base maps to have knowledge of the road system, which is not part of the product.

Re: [MarkLogic Dev General] XQuery Vs XSLT - Any Performance benefit

2014-11-11 Thread Mary Holstege
I would expect performance to be similar or favor XQuery. A lot of the underlying implementation is shared, and where it is not, the XQuery has a lot more person years of optimization work behind it. //Mary On Nov 11, 2014 4:39 AM, abhishek.srivas...@cognizant.com wrote: Hi All, We are using

Re: [MarkLogic Dev General] XQuery Vs XSLT - Any Performance benefit

2014-11-11 Thread Mary Holstege
On Tue, 11 Nov 2014 12:42:28 -0800, Betty Harvey har...@eccnet.com wrote: My experience is that the XSLT and straight XQuery are comparable as far as performance. One exception is if you are moving up and down the tree or have a need to traverse the entire document several times, XQuery

Re: [MarkLogic Dev General] xdmp:output serialization options not working?

2014-10-14 Thread Mary Holstege
On Tue, 14 Oct 2014 12:28:52 -0700, David Sewell dsew...@virginia.edu wrote: Given this code: xquery version 1.0-ml; declare option xdmp:output indent-untyped=yes; declare option xdmp:output omit-xml-declaration=yes; xdmp:document-insert( /test.xml, doc lineline 1/line

Re: [MarkLogic Dev General] xsl:analyze-string grouping bug?

2014-09-08 Thread Mary Holstege
fn:analyze-string also works. It looks like the code that is processing the matches is getting confused in setting up the group context for regex-group for some reason. //Mary On Mon, 08 Sep 2014 06:45:20 -0700, David Lee david@marklogic.com wrote: A quick test with XQuery , fn:replace

Re: [MarkLogic Dev General] Attribute indexing

2014-08-19 Thread Mary Holstege
You can create a path index that ends in an attribute. //Mary On Aug 19, 2014 5:47 AM, irisDeveloper developerati...@gmail.com wrote: Hi all, I didn't find anything in database configuration , for attribute indexing. Does MarkLogic support xml attribute indexing? Thanks Samby

Re: [MarkLogic Dev General] two questions about using marklogic

2014-08-12 Thread Mary Holstege
On Aug 12, 2014 2:36 AM, qinggangwa...@gmail.com wrote: Dear All, I am researching for a task that need a good performance with OLAP and OLTP under the framework of hadoop. I have installed marklogic and hadoop succefully. There are two places that I do not understand well. Does

Re: [MarkLogic Dev General] Problems with Processing Instructions

2014-07-30 Thread Mary Holstege
On Wed, 30 Jul 2014 01:51:12 -0700, neil bradley n...@bradley.co.uk wrote: But it seems I cannot have a PI before a root element. This does not work when I enter it into QC, and I get “unexpected token syntax error”: ?hi there? Root?hi there?/Root The thing to know is that this may

Re: [MarkLogic Dev General] XSLT check if result document has been created

2014-07-28 Thread Mary Holstege
I think you may be running afoul of URI resolution. Since the URI you are giving to doc-available is a relative URI, it will be resolved relative to the static base URI, which per XSLT is the URI of the stylesheet itself. //Mary On Mon, 28 Jul 2014 07:58:04 -0700, Erik Zander

Re: [MarkLogic Dev General] Capturing cpf error details in ErrorLog.txt

2014-07-24 Thread Mary Holstege
On Thu, 24 Jul 2014 06:18:21 -0700, John Zhong j...@yuxipacific.com wrote: According to the default 'Status Change Handling' pipeline, it uses the default error handler /MarkLogic/cpf/actions/failure-action.xqy, which calls the cpf:failure api to handle the errors. If you look at this

Re: [MarkLogic Dev General] Strict validation error encountered in ML7 but not in ML6

2014-07-16 Thread Mary Holstege
On Wed, 16 Jul 2014 07:29:18 -0700, Damon Feldman damon.feld...@marklogic.com wrote: I can only think of wanting to clear the Expanded Tree Cache when you are doing performance testing and want to compare cold to warm caches. MarkLogic automatically invalidates documents in the cache

Re: [MarkLogic Dev General] Strict validation error encountered in ML7 but not in ML6

2014-07-14 Thread Mary Holstege
On Mon, 14 Jul 2014 10:43:40 -0700, Tim t...@aaom.net wrote: How do I clear the expanded tree cache? xdmp:expanded-tree-cache-clear() You need to be admin to run it. Tim -Original Message- From: Mary Holstege [mailto:mary.holst...@marklogic.com] Sent: Saturday, July 12, 2014 9

Re: [MarkLogic Dev General] Strict validation error encountered in ML7 but not in ML6

2014-07-12 Thread Mary Holstege
On Sat, 12 Jul 2014 11:20:08 -0700, Tim t...@aaom.net wrote: Hi Folks, When I run strict validation of a document against a schema using MarkLogic 6.0-2 (which has been running consistently and without a problem for years), it passes validation, but when I run strict validation using the

Re: [MarkLogic Dev General] Strict validation error encountered in ML7 but not in ML6

2014-07-12 Thread Mary Holstege
On Sat, 12 Jul 2014 12:45:47 -0700, Tim t...@aaom.net wrote: After a little more investigating, it looks like there is a problem updating a schema. In this case I'm simply adding some optional element attributes to the schema and updating it in the schemas database, but strict validation of

Re: [MarkLogic Dev General] Marklogic Collation equivalent to Chinese_PRC_CI_AS

2014-06-25 Thread Mary Holstege
On Wed, 25 Jun 2014 01:07:29 -0700, Ashish Bhardwaj napster.ash...@gmail.com wrote: Chinese_PRC_CI_AS. AFAICT CI = case insensitive AS = accent sensitive PRC = People's Republic of China So I think you want: http://marklogic.com/collation/zh/S2 zh = Chinese using PRC tailorings S2 = Case

Re: [MarkLogic Dev General] Multilanguage sorting

2014-06-13 Thread Mary Holstege
On Fri, 13 Jun 2014 00:59:20 -0700, Ashish Bhardwaj napster.ash...@gmail.com wrote: Hello Mike, Thanks a lot for your reply. I am storing data in Marklogic using the xml I mentioned before. Here is an simplified version of xml that I am storing in ML: language value

Re: [MarkLogic Dev General] Fw: Search results differ for words ending in 's

2014-06-12 Thread Mary Holstege
On Thu, 12 Jun 2014 03:32:51 -0700, Sikandar send2i...@yahoo.co.in wrote: We are using search:search and also enabled stem search in DB though search results differ for words ending in 's. Example when searching for teachers the results returned are for teacher and teachers, but when

Re: [MarkLogic Dev General] Slash and Dot is treated as word separator for search

2014-06-12 Thread Mary Holstege
On Thu, 12 Jun 2014 05:44:37 -0700, Amit Kumar Dev amit_...@infosys.com wrote: Hi, I am using one character search with term D* in class-number node of the xml. I am getting docs which has class-number starts with D In response I am also getting few documents which contains D after /

Re: [MarkLogic Dev General] Chasing - Bug in validating an element embedded in another element?

2014-06-03 Thread Mary Holstege
On Tue, 03 Jun 2014 08:46:00 -0700, Retter, Adam (RBI-UK) adam.ret...@rbi.co.uk wrote: Hi, I saw an email from Florent Georges to this mailing list in October 2012 that Mary Holstege initially replied to, but there was no further follow up. The original message is available here

Re: [MarkLogic Dev General] Path Range Query Not working as expected

2014-05-21 Thread Mary Holstege
On Wed, 21 May 2014 08:04:10 -0700, gnanaprakash.bodire...@cognizant.com wrote: Hi I tried to perform query trace on all these 3 queries. Interestingly all are selecting one fragment which is expected but the 2nd query is not giving me count and also when I remove count I am not

Re: [MarkLogic Dev General] One character wild card search without using filter search

2014-03-31 Thread Mary Holstege
need to enable single character index or is there any other alternative to achieve the same. Thanks and regards Amit Dev -Original Message- From: Mary Holstege [mailto:mary.holst...@marklogic.com] Sent: Friday, March 14, 2014 3:56 AM To: general@developer.marklogic.com; Amit Kumar

Re: [MarkLogic Dev General] One character wild card search without using filter search

2014-03-13 Thread Mary Holstege
On Tue, 11 Mar 2014 20:53:52 -0700, Amit Kumar Dev amit_...@infosys.com wrote: Hi All, * I am performing one character wild card search on title element. i.e. cts:element-query cts:elementTitle/cts:element cts:word-query weight=2

  1   2   3   >