split document by using MarkLogic mlcp

I need to split this document

<?xml version="1.0"?> <!DOCTYPE docs SYSTEM "../roempp11.dtd"> <docs> <stwtext id="RD-10-00258" update="03.2011" seq="RQ-10-00001"> <head> <ti> <i>j</i> </ti> <ff-list> <ff id="0103" /> </ff-list> </head> <p> Symbol für die <vw idref="RD-19-04447">Stromdichte</vw> . </p> </stwtext> <stwtext id="RD-10-00209" update="12.2007" seq="RQ-10-00223"> <head> <ti>JZ</ti> <ff-list> <ff id="0932" /> </ff-list> </head> <p> Abkürzung für Jod-Zahl, siehe <vw idref="RD-06-00645">Fettkennzahlen</vw> . </p> </stwtext> </docs>

i do it with this command:

~> bin/mlcp.sh IMPORT -mode local -host localhost -port 15000 \ -username admin -password admin \ -input_file_path /media/sf_vm.shared/thieme/roemp-training/v10.new-ML.XML \ -output_uri_replace "/media/sf_vm.shared/thieme/roemp-training/keywords,'roempp-data'" \ -output_collections roemp-data \ -input_file_type aggregates -aggregate_record_element stwtext \ -aggregate_uri_id @id

The command works fine, but I see in MarkLogic the documents with ids, which don't belong to declared stwtext.id, but to the id of last element. For example, for my document I am expecting to see

RD-10-00258 RD-10-00260

but actually it looks like this:

0103 0932

Is it bug, or perhaps I did something wrong ? thanks

-------------Problems Reply------------

It's a bug. If you'd like to, you can download the source code for MLCP and change it. Take a look at AggregateXMLReader.java's processStartElement().

Category:marklogic Views:0 Time:2019-03-14
Tags: marklogic

Related post

  • Document clasification, using genetic algorithms 2011-01-17

    I have a bit of a problem with my project for the university. I have to implement document classification using genetic algorithm. I've had a look at this example and (lets say) understood the principles of the genetic algorithms but I'm not sure how

  • Split a string using whitespace in Javascript? 2012-02-22

    I need a tokenizer that given a string with arbitrary white-space among words will create an array of words without empty sub-strings. For example, given a string: " I dont know what you mean by glory Alice said." I use: str2.split(" ") This also ret

  • Return the most similar document compared to a query document by using Cosine similarity in python 2012-02-28

    I have a set of files and a query doc.My purpose is to return the most similar documents by comparing with query doc for each of the document.To use cosine similarity first i have to map the document strings to vectors.Also i have already created a t

  • How do I create a document template using APA 6th edition? 2012-01-20

    Split from this thread. Need to create a document template using APA 6th edition format for all my future research papers? How do I create one on the Microsoft 2011 on my Mac computer? I don`t have a mac computer, i a have a acer computer using Windo

  • Split document into multiple documents 2015-03-03

    I am using VB.NET. I have a function that reads a Word document and splits it into multiple documents, based on finding a certain paragraph of text sprinkled throughout the original document. Here is part of the code that I'm using: Dim BookDoc As Ne

  • How Do I 'Restore' MS Word's Document Window Using Automation 2009-06-11

    I would like to be able to 'restore' MS Word's document window using Automation. I already have the application object and I have tried calling App.Activate. Activate will bring the window to the top of the Window stack but not if Word is minimized.

  • split a string using a open and close tag 2009-11-11

    I wish to known if exist a clean way to split a string using different tags for opening and ending. For example: <&field1&>outside<&field2&> using the function split: string[] dd={"<&","&>"}; string[] b1 = a1

  • How to iterate through Folders and sub folders of a Sharepoint Document Library using web references 2010-01-20

    I am trying to access Folder names from Document Libraray using web services in C#. I am to get the first level folder names using getlistitems. Howvever i am not able to get sub folders and documents. Can any body help me with this. --------------So

  • Is there another way to load MSHTML documents without use Application.ProcessMessages? 2010-04-09

    Is there another way to load MSHTML documents without use Application.ProcessMessages? To load a document into a IHTMLDocument I need to do this: while Doc.readyState <> 'complete' do Application.ProcessMessages; I want not to process all the m

  • opening MS word document without using com object 2010-05-25

    Hai frnd can give me some solution.. 1.how to open ms word document without using com(word.application) 2.actually i want to edit existing document only changing content without affecting any properties? --------------Solutions------------- Zend_Serv

  • I want to split string without using split function? 2010-05-30

    I want to split string without using split . can anybody solve my problem I am tried but I cannot find the exact logic. --------------Solutions------------- I'm going to assume that this is homework, so I will only give snippets as hints: Finding ind

  • Steps to perform document clustering using k-means algorithm in java 2010-08-17

    I need steps to perform document clustering using k-means algorithm in java. It will be very useful for me to provide the steps easily. Thanks in advance. --------------Solutions------------- You need to count the words in each document and make a fe

  • How to close a Visual Studio tool window (not a document tab) using a keyboard shortcut? 2010-08-27

    As the title suggests, I would like to close a Visual Studio tool window (not a document tab) using a keyboard shortcut. Is that possible? --------------Solutions------------- Shift + Esc seems to work

  • MongoDB: Is it OK to use a second objectID in a document to use for concurrency check on update? 2011-01-03

    Is it OK to use a second objectID in a document to use to test concurrency on an update? Project: Microsoft MVC2 / C# / Mongo 1.6 / 10Gen C# driver (v0.9.0) I'm trying to wrap my head around concurrency issues (and NOSQL repository). This is my first

  • Split a paragraphs using RegularExpression (Regex).? 2011-01-17

    How can I split a paragraph using following methods First Method = {A-Z} <br> Second Method = {A-Z}<br> Third Method = {A-Z}.<br> Fourth Method = {A-Z}.\r\n > Note: Input may contain combination of all above methods. Please help

  • Import xml file to word document template using vbscript 2011-03-06

    I'm hoping someone is able to help me with what I am trying to do, I have looked around the internet but haven't found anything that does exactly what I want. What I am trying to do is create a Word Document using data from a SQL Server 2000 DB. At t

  • Call a custom document converter using Sharepoint Object Model 2011-03-07

    How to call a custom SharePoint converter that is activated for a specific website. For example, the below code is used to get the GUID of that converter foreach (SPDocumentConverter converter in converters) { //Console.WriteLine(converter.DisplayNam

  • Split a string using a string as delimiter in awk 2011-03-17

    System : Solaris I am trying to split a string using the delimiter as another string For example: The main string is : /as/asdasd/asdasd/root/asdqwe/asd/asssdd/ I wanna split this into two part from the "root" substring such that $1 = /as/asdasd/asda

  • Document management using Plone4 2011-05-08

    I'm interested in making a document management system from Plone 4. I need pointers to a way to integrate FlexPaper and a way for asynchroneous document transformation using existing tools like SWFTools, Ghostscript, ImageMagick, Batik etc.. Thank yo

  • How can I split a string using a string delimeter? 2011-05-11

    How can I split a string using a string delimeter? I've tried: string[] htmlItems = correctHtml.Split("<tr"); I get the error: Cannot convert from 'string' to 'char[]' What's the recommended way to split a string on a given string parameter? -----

  • Adding document to Sharepoint 2010 document list using List service 2011-05-17

    I am looking for an example in C#.net how to add document/file to sharepoint document list using its List service but didn't found any satisfactory example? looking for an example to which I can refer. Thanx! --------------Solutions------------- I do

  • Is there an example XML document that uses every feature of XML for testing? 2011-06-01

    Does anyone know where I can get a canonical XML document that uses every feature of the XML 1.0 specification that aren't mutually exclusive? Not including various encoding flavors. If I can get a single document encoded in UTF-8 that would be fine.

  • How to split three column using linear layout in Android 2011-06-30

    Can anybody tell me how to split three column using linear layout in Android? --------------Solutions------------- http://developer.android.com/resources/tutorials/views/index.html see first example, split in 3 instead of 4. Simple, just add android:

  • Is it possible to use document.writeln ( using Javascript) when trying to create a nested list? 2011-07-10

    Is it possible to use document.writeln when trying to create a nested list? I am at the very very basic of learning javascript and don't know what I am doing wrong. I need to create a nested list using document.writeln, which seems to work. But, when

  • Querying an XML Document Object using XPath 2011-09-19

    I currently have a webpart with a submit button and two text fields for phone number and postcode. When a phone number and a postcode are entered into the text fileds and the submit is pressed, I want to be able to submit a query string that queries

Copyright (C) dskims.com, All Rights Reserved.

processed in 0.149 (s). 11 q(s)