<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Bruised Edge &#187; XML</title>
	<atom:link href="http://weblog.kevinclarke.info/category/xml/feed/" rel="self" type="application/rss+xml" />
	<link>http://weblog.kevinclarke.info</link>
	<description>Digital Libraries, Repositories, Programming, Technology, Librarianship, etc.</description>
	<lastBuildDate>Wed, 28 Jul 2010 03:19:41 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Do You Trust Your Data Modelers?</title>
		<link>http://weblog.kevinclarke.info/2006/12/18/do-you-trust-your-data-modelers/</link>
		<comments>http://weblog.kevinclarke.info/2006/12/18/do-you-trust-your-data-modelers/#comments</comments>
		<pubDate>Mon, 18 Dec 2006 19:47:15 +0000</pubDate>
		<dc:creator>ksclarke</dc:creator>
				<category><![CDATA[Databases]]></category>
		<category><![CDATA[XML]]></category>

		<guid isPermaLink="false">http://kevinclarke.info/weblog/?p=252</guid>
		<description><![CDATA[In the #code4lib IRC channel today Ed Summers asked me some good questions about storing metadata in a native XML database. The gist of his questions was that he wasn&#8217;t sure he saw any advantages that a native XML database might have over a relational database (yes, I&#8217;m simplifying a bit I&#8217;m sure). As we [...]]]></description>
			<content:encoded><![CDATA[<p>In the <a href="http://www.code4lib.org/irc/" title="#code4lib">#code4lib IRC</a> channel today <a href="http://www.inkdroid.org/journal/" title="Inkdroid">Ed Summers</a> asked me some good questions about storing metadata in a native XML database.  The gist of his questions was that he wasn&#8217;t sure he saw any advantages that a native XML database might have over a relational database (yes, I&#8217;m simplifying a bit I&#8217;m sure).  As we were winding down he said, &#8220;just preppin you for my questions at <a href="http://www.code4lib.org/2007" title="Code4Lib 2007">code4libcon</a>.&#8221;</p>
<p>My first thought after digesting the conversation was, &#8220;Hey, wait, I&#8217;m not even talking about native XML databases at code4libcon!&#8221;  My proposal is about using XQuery in the digital library realm.  True, we are using a native XML database here, but just because one uses XQuery doesn&#8217;t mean s/he is using a native XML database.  You can use XQuery just as easily with <a href="http://publib.boulder.ibm.com/infocenter/db2luw/v9/index.jsp?topic=/com.ibm.db2.xquery.doc/xqrbasics.html" title="DB2 XQuery">DB2</a> or <a href="http://www.oracle.com/technology/tech/xml/xquery/index.html" title="Oracle XQuery">Oracle&#8217;s database</a> (or <a href="http://www.gnu.org/software/qexo/" title="Qexo">files on the file system</a>).</p>
<p>The one thing that native XML databases and XQuery do have in common, though, is that they let you interact with your data directly &#8212; it doesn&#8217;t have to be deconstructed into another structure and then reconstructed when you want the whole thing back out again (in the case of XQuery being used over a relational database, that (de|re)construction takes place invisibly in the database layer).</p>
<p>But, is this a good thing?  Ed kept saying he didn&#8217;t see any data modeling going on with native XML databases.</p>
<p>There is data modeling going on with native XML databases, I&#8217;d suggest, but it happens on the metadata side of things.  Andrew Nagy made this observation recently on the code4lib mailing list when he noted how poorly just putting MARCXML into a native XML database performs.</p>
<p>This is because putting MARCXML into a native XML database makes MARCXML the data model.  MARC was intended for concise transfer, not for working with the data&#8230; it was assumed by the architects of MARC (I believe and hope) that MARC would be reconstructed into something else before anyone tried to do anything meaningful with it (for what it is worth, this only partially happens in the library world).</p>
<p>XQuery allows the developer to work more nimbly with the data models s/he is given (instead of mapping them into other data models that match the database s/he has chosen to use).  So, what are these data models?  They are the XML metadata standards being created by the different knowledge communities.  Unsurprisingly, to use these data models, the developer needs to know them (i.e., having programmers in our libraries is a better idea than contracting out to people not in the profession).</p>
<p>Are the people creating these (meta)data models working to make accessing the data easier?  That could be one critique leveled at native XML databases (from the perspective of the developers)&#8230; if you aren&#8217;t doing the data modeling, can you trust the people who are?</p>
<p>It&#8217;s not that bad though (put down that gun you cynical library developers); keep in mind that XQuery isn&#8217;t a fulltext query language.  Think of it more as a database query language (even though there doesn&#8217;t have to be a database).</p>
<p>To use XQuery in the digital library world, in my opinion, you still need to use a fulltext indexer (like <a href="http://lucene.apache.org/java/docs/index.html" title="Lucene">Lucene</a> or the type built into many XML-enabled databases). Indices may be used through proprietary extensions to the XQuery language (indicated by a different namespace) or through separate processes which feed the XQuery engine (as in a pre-processing stage).</p>
<p>For what it is worth, there is a <a href="http://www.w3.org/TR/xquery-full-text/" title="XQuery Fulltext">fulltext</a> extension to the XQuery spec that is being written to take advantage of these external indices, but it is not really out there in the world yet. In the meantime, even if our metadata models (e.g., MARCXML) aren&#8217;t the best, we can still create and use indices that provide an intelligent view of the data.</p>
<p>One nice thing about working directly with the data/data models you receive is that there are less &#8220;moving parts&#8221; to fix when things change (in other words, less things to get in the way) &#8212; because we all know digital library standards don&#8217;t change, right?</p>
<p>Rather than go through the process of re-mapping to the database&#8217;s structure, you only need to modify the parts of your code that deal with the parts of the metadata that have changed. You&#8217;d have to do this with the other option too&#8230; just because you have a standard way of normalizing data doesn&#8217;t mean that all data is structured in the same way (in terms of how you get at the pieces you want).</p>
<p>I could mention, I guess, some reasons why I like native XML databases in my presentation, but I&#8217;m not sure this is a good idea.  I think it may distract from the beauty that is XQuery.  I&#8217;m also hoping Andrew Nagy will cover this territory in his presentation comparing different native XML databases. One of XQuery&#8217;s strengths is that it is database agnostic; I shouldn&#8217;t stray from that.</p>
<p>For the record, the Ed Summers that appears in this post is not the real Ed (despite the conversation actually happening), but one of my own conception for rhetorical purposes only.  <img src='http://weblog.kevinclarke.info/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://weblog.kevinclarke.info/2006/12/18/do-you-trust-your-data-modelers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Renesis of SVG</title>
		<link>http://weblog.kevinclarke.info/2005/08/16/the-renesis-of-svg/</link>
		<comments>http://weblog.kevinclarke.info/2005/08/16/the-renesis-of-svg/#comments</comments>
		<pubDate>Tue, 16 Aug 2005 23:33:17 +0000</pubDate>
		<dc:creator>ksclarke</dc:creator>
				<category><![CDATA[XML]]></category>

		<guid isPermaLink="false">http://kevinclarke.info/weblog/?p=217</guid>
		<description><![CDATA[Just heard a very interesting presentation and (amusing) question and answer period on Renesis, the soon to be released SVG viewer and development platform from EvolGrafix. It was one of the parts of the conference I was most looking forward to (for practical reasons) and I was not disappointed (though it would have been nicer [...]]]></description>
			<content:encoded><![CDATA[<p>Just heard a very interesting presentation and (amusing) question and answer period on <a href="http://www.gosvg.net/?page_id=2">Renesis</a>, the soon to be released SVG viewer and development platform from <a href="http://www.evolgrafix.com/">EvolGrafix</a>. It was one of the parts of the conference I was most looking forward to (for practical reasons) and I was not disappointed (though it would have been nicer if they had surprised us all and made another beta version available — there was one at one point, apparently, that reached the 1.5 million download mark in only two weeks).</p>
<p>The talk started out with what they see as their business model for developing their suite of tools (which will contain a freely available (as in beer) cross-browser plugin that fully supports the upcoming SVG 1.2 standard). In short, they see “software as a service” as an effective business plan — one that will enable them to release the plugin for free (the plugin should be okay with any valid SVG so won’t have to be used with their non-free products). The presentation then moved into the description of the product (which sounds really great). It ended with a long question and answer period from the SVG experts in audience.</p>
<p>I think it was the longest and most amusing question and answer period I have ever heard at a conference. Unlike many of the presentations, this was really a vendor presentation — not so much of a presentation on a particular project. As a result, there were many claims about what their product will do and, since it will not be released until the end of the year, the experts in the audience were (probably rightfully) a bit skeptical (after all the product claims to have done what many in the XML community have been wanting for awhile now). The questions from the SVG experts ranged from the obvious to some really interesting clarifications of EvolGrafix’s claims.</p>
<p>It was also amusing because it seems that EvolGrafix has not been involved in the W3C and has been vocal about what could be done if the standard was moved along a little quicker than it has been. There was, of course, an invitation from one of the SVG working group members for EvolGrafix to join the group and help move the standard along more quickly (so that it can better compete with Flash — which would be good for any and all implementors). The whole question and answer period had the audience tittering. Quite amusing…</p>
<p>Another interesting snippet from the presentation was that EvolGrafix does not plan to implement any form of XForms (interesting to us in particular because towards XForms is the direction in which we’ve been wanting to move). They see sXBL as the future direction for forms and data binding. I don’t know much about sXBL. From what I gather, instead of having a compound document that might contain an XForm and an SVG image (along with other XHTML), sXBL would allow the binding of data to exist in the SVG component.</p>
<p>Also interesting… Renesis will support interacting with the DOM of the XHTML of the page; it will not, though, if I understand correctly, allow interaction between the SVG and JavaScript in the HTML. This is because Renesis does all its ECMAScript and C# work in its own interpretter (one for each of those languages). Yes, they have implemented their own C# interpretter and JavaScript interpretters (the whole SVG app could be compiled to native code if one wanted to do that — and that is one of the options they see as a possibility further down the line).</p>
<p>This sounds like a huge project and I cannot wait til the end of the year to see if a product is released. If it is, and it really supports the 1.2 spec (and all they have promised) without requiring any proprietary dependencies, this could really benefit SVG’s growth (especially since Adobe will probably be moving more towards Flash with their purchase of Macromedia). One has to take a wait and see approach I think. If it happens, though, this could help push us towards SVG as a viable way to add more interactivity into our web applications.</p>
]]></content:encoded>
			<wfw:commentRss>http://weblog.kevinclarke.info/2005/08/16/the-renesis-of-svg/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SVG Open 2005</title>
		<link>http://weblog.kevinclarke.info/2005/06/01/svg-open-2005/</link>
		<comments>http://weblog.kevinclarke.info/2005/06/01/svg-open-2005/#comments</comments>
		<pubDate>Wed, 01 Jun 2005 22:13:08 +0000</pubDate>
		<dc:creator>ksclarke</dc:creator>
				<category><![CDATA[XML]]></category>

		<guid isPermaLink="false">http://kevinclarke.info/weblog/?p=216</guid>
		<description><![CDATA[Hooray! I’ve been approved to attend SVG Open this year. It looks like a very interesting conference. We are not currently doing anything with SVG (I’d be interested to hear from libraries who are), but are very interested by many of the possibilities that it offers. We currently have an image navigator/page-turning site, but there [...]]]></description>
			<content:encoded><![CDATA[<p>Hooray! I’ve been approved to attend <a href="http://www.svgopen.org/2005/">SVG Open</a> this year. It looks like a very interesting conference. We are not currently doing anything with SVG (I’d be interested to hear from libraries who are), but are very interested by many of the possibilities that it offers. We currently have an <a href="http://diglib1.princeton.edu/">image navigator/page-turning site</a>, but there isn’t a great deal of interactivity that can be accomplished without using a plugin of some sort (though I wonder about AJAX now that Google is doing so much with it).</p>
<p>We are thinking about adding these additional features as an optional bonus… users could still use the existing, more accessible, site or could choose the more interactive one (that would require a plugin — at least for the time being). We looked at the possibility of a Java applet, a Flash script, or something with SVG. SVG is the most appealing because it is XML and would fit with our current architecture (and because it is an open standard and not owned by any one company (as a side note: it will be interesting to see what happens with Adobe now that they own Flash and the leading SVG plugin)).</p>
<p>SVG, not currently supported natively in browsers (though Opera and Firefox/Mozilla have committed to building in native support), has seen a lot of growth lately in the cell phone market (and in bleeding edge desktops, like Gnome, which support SVG for images/icons). There are also <a href="http://www.gosvg.net/?page_id=2">new SVG plugins</a> waiting in the wings to be announced (supposedly at the SVG Open conference). Still, SVG is far from a widely implemented technology.</p>
<p>What we would like from it is some sort of traditional page turning animation where turning the pages results in a page curl. That is not the key feature though. Being able to select regions of interest dynamically (instead of relying on predefined tiles) and being able to zoom in an out more fluidly would be nice (we can do this with our own interface now, but not so interactively as we would like). I think going to the conference will be a fact finding mission. Does SVG do what we think it can? Who is doing interesting work with SVG (from which we might be able to benefit).</p>
<p>There are some digital libraries out there doing interesting work with Flash and Java applets, but having everything in XML is just too convenient (and future proof(?)) What we really need, though, is for a tool to support loading JPEG2000 images into an SVG document (Batik currently doesn’t do this). We’re thinking about getting rid of our archival TIFF images (and just keeping a backed-up JP2 image); it would also be nice to not have to go from JP2 to JPEG to SVG (going, instead, just from JP2 to SVG). I don’t know… I’m surfing the Web trying to learn more about this. I expect the conference will also be of great benefit.</p>
]]></content:encoded>
			<wfw:commentRss>http://weblog.kevinclarke.info/2005/06/01/svg-open-2005/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
