<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>it’s all semantics</title>
	<atom:link href="http://blog.silverchair.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.silverchair.com</link>
	<description>Semantic Strategy Insights for Publishers</description>
	<lastBuildDate>Mon, 17 May 2010 14:51:23 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='blog.silverchair.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/c65cb61b7068f8507109857024ad8976?s=96&#038;d=http://s2.wp.com/i/buttonw-com.png</url>
		<title>it’s all semantics</title>
		<link>http://blog.silverchair.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://blog.silverchair.com/osd.xml" title="it’s all semantics" />
	<atom:link rel='hub' href='http://blog.silverchair.com/?pushpress=hub'/>
		<item>
		<title>Semantic Web Elevator Speech</title>
		<link>http://blog.silverchair.com/2010/05/17/semantic-web-elevator-speech/</link>
		<comments>http://blog.silverchair.com/2010/05/17/semantic-web-elevator-speech/#comments</comments>
		<pubDate>Mon, 17 May 2010 13:27:45 +0000</pubDate>
		<dc:creator>Pam Harley</dc:creator>
				<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[Cortex taxonomy]]></category>
		<category><![CDATA[UMLS]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=429</guid>
		<description><![CDATA[On today’s HTML-based web, Jabin’s shopping list is tagged so that it can be displayed as a list, but not much else. The future’s semantic shopping list defines the list as relating to a trip to the grocery store, and adds a category for each list item, allowing a computer to do much more with it than simply rendering it in list format. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=429&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>Many thanks to Jabin White, Director of Strategic Content for Wolters Kluwer Health Professional &amp; Education division, for my new “elevator speech” for explaining the Semantic Web (<a href="http://www.scientificamerican.com/article.cfm?id=the-semantic-web" target="_blank">originally conceived by Tim Berners-Lee</a>). My “aha” moment came during Jabin’s presentation at a recent Association of American Publishers Professional and Scholarly (AAP PSP) division <a href="http://www.pspcentral.org/SemanticPublishingFlyer.cfm" target="_blank">seminar on semantic publishing</a>. Here’s the relevant slide: </p>
<p style="text-align:center;">
<div id="attachment_432" class="wp-caption aligncenter" style="width: 478px"><a href="http://semedica.files.wordpress.com/2010/05/jabinwhite_semanticshopping1.gif" target="_blank"><img class="size-full wp-image-432 " title="Jabin White's Semantic Web Shopping List" src="http://semedica.files.wordpress.com/2010/05/jabinwhite_semanticshopping1.gif?w=468&#038;h=254" alt="Jabin White's Semantic Web Shopping List" width="468" height="254" /></a><p class="wp-caption-text">Semantic Web shopping list (courtesy of Jabin White)</p></div>
<p style="text-align:left;">The Semantic Web will both <em>require</em> and <em>consume</em> metadata of the type shown in the right-hand, XML-labeled column. On today’s HTML-based web, Jabin’s shopping list is tagged so that it can be displayed as a list, but not much else. The future’s <em>semantic</em> shopping list defines the list as relating to a trip to the grocery store, and adds a category for each list item, allowing a computer to do much more with it than simply rendering it in list format. With the right “app,” Jabin’s semantic list could be sorted to match the aisle-by-aisle layout as he walks into a grocery store. Or it could warn him that his ratio of grain to veggie will blow his diet. </p>
<p style="text-align:left;">Of course, still to come to enable a full realization of the Semantic Web is a controlled vocabulary (or taxonomy, or ontology) to serve as the authority file—the “single source of truth”—for these semantic categories and concepts, so that systems can understand that “veggie” is the same thing as “vegetable,” and that “grain” in this context doesn’t refer to sand or bullet weights. You can imagine the challenges of language variation and disambiguation as we move from grocery shopping to more complex domains such as in the scientific, technical, and medical world. </p>
<p style="text-align:left;">In the medical domain, the National Library  of Medicine’s UMLS metathesaurus serves as just such an authority file. UMLS maps—to a single unique concept ID—the many different terminologies used in health care (to define insurance reimbursement and billing codes, disease diagnoses and treatments, drug information, medical literature concepts, and so on; UMLS maps 100+ terminologies). Silverchair’s <a href="http://www.semedica.com/cortex.aspx" target="_blank">Cortex biomedical taxonomy</a> is connected to UMLS through this unique ID, but our team of taxonomy and subject matter experts work continually to make sure that Cortex and its companion Equivalents Server include the new concepts and evolving language that we encounter daily in the medical literature and through user interactions with our applications. We add new concepts and equivalents every working day, but they often don’t make it into the UMLS mapping for months or even years—an eternity in a fast-changing field like medicine. </p>
<p style="text-align:left;">But even with its drawbacks, UMLS gives medicine a significant leg up in terms of semantic infrastructure. At Silverchair we are betting that the promise of the Semantic Web will be realized for medicine well before most other domains.</p>
<br />Filed under: <a href='http://blog.silverchair.com/category/semantic-web/'>Semantic Web</a>, <a href='http://blog.silverchair.com/category/taxonomy/'>taxonomy</a>, <a href='http://blog.silverchair.com/category/web-3-0/'>Web 3.0</a> Tagged: <a href='http://blog.silverchair.com/tag/cortex-taxonomy/'>Cortex taxonomy</a>, <a href='http://blog.silverchair.com/tag/semantic-web/'>Semantic Web</a>, <a href='http://blog.silverchair.com/tag/taxonomy/'>taxonomy</a>, <a href='http://blog.silverchair.com/tag/umls/'>UMLS</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/429/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=429&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2010/05/17/semantic-web-elevator-speech/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/d4feb71f4be6fa21b7184ec2bfbc1b83?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Pam Harley</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2010/05/jabinwhite_semanticshopping1.gif" medium="image">
			<media:title type="html">Jabin White's Semantic Web Shopping List</media:title>
		</media:content>
	</item>
		<item>
		<title>Where Does It Hurt?</title>
		<link>http://blog.silverchair.com/2010/05/03/where-does-it-hurt/</link>
		<comments>http://blog.silverchair.com/2010/05/03/where-does-it-hurt/#comments</comments>
		<pubDate>Mon, 03 May 2010 20:20:55 +0000</pubDate>
		<dc:creator>Elizabeth Willingham</dc:creator>
				<category><![CDATA[business trends]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=422</guid>
		<description><![CDATA[We&#8217;re looking forward to the report that comes from this survey posted by the Gilbane Group, designed &#8220;to identify a number of &#8216;pain points&#8217; or barriers encountered by book publishers when it comes to developing or expanding digital publishing programs&#8230;&#8221; and encourage you to complete it if you&#8217;re on the front lines of digital transformation. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=422&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>We&#8217;re looking forward to the report that comes from <a href="http://gilbane.com/publishing/2010/04/the-gilbane-group-survey-for-book-publishing-professionals.html" target="_blank">this survey</a> posted by the Gilbane Group, designed &#8220;to identify a number of &#8216;pain points&#8217; or barriers encountered by book publishers when it comes to developing or expanding digital publishing programs&#8230;&#8221; and encourage you to complete it if you&#8217;re on the front lines of digital transformation. We have some ways to help with those obstacles and will comment on the report after it&#8217;s published.</p>
<br />Filed under: <a href='http://blog.silverchair.com/category/business-trends/'>business trends</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/422/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/422/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/422/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/422/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/422/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/422/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/422/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/422/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/422/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/422/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=422&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2010/05/03/where-does-it-hurt/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e71ef4b47e6ba5c898bdeabbb8e47d6e?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Elizabeth Willingham</media:title>
		</media:content>
	</item>
		<item>
		<title>The Real World &gt; Silverchair</title>
		<link>http://blog.silverchair.com/2010/04/02/the-real-world-silverchair/</link>
		<comments>http://blog.silverchair.com/2010/04/02/the-real-world-silverchair/#comments</comments>
		<pubDate>Fri, 02 Apr 2010 21:38:37 +0000</pubDate>
		<dc:creator>Elizabeth Willingham</dc:creator>
				<category><![CDATA[classification/tagging]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[medical terminology]]></category>
		<category><![CDATA[semantic tagging]]></category>
		<category><![CDATA[thesaurus]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=405</guid>
		<description><![CDATA[The thesaurus supporting our Cortex medical taxonomy is distinguished from other standards by its inclusion of “real-world” equivalents. We generally call these “equivalents” rather than synonyms because we include things that arguably aren’t purely synonyms—jargon or shorthand versions of medical terminology that we run across in the medical literature we tag. More often, though, we [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=405&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-407" title="The Real World logo" src="http://semedica.files.wordpress.com/2010/04/the_real_world_logo_svg.png?w=250&#038;h=187" alt="The Real World logo" width="250" height="187" />The thesaurus supporting our Cortex medical taxonomy is distinguished from other standards by its inclusion of “real-world” equivalents. We generally call these “equivalents” rather than synonyms because we include things that arguably aren’t purely synonyms—jargon or shorthand versions of medical terminology that we run across in the medical literature we tag. More often, though, we learn about these equivalents (and common misspellings, which we also add to our thesaurus) by reviewing search queries submitted by real users to the sites we’ve built. Some examples: “C diff” for “<em>Clostridium difficile</em>,” “FB in foot” for “foreign body in foot,” “P4P” for “pay for performance,” “echo” for “echocardiography.” </p>
<p>Unlike some taxonomies that have a more “academic” (read: stodgy) approach to what is considered a synonym, we put real-world equivalents in our thesaurus because we want it to work for <em>real-world users.</em> Many users of our health care information sites are pressed for time and are looking for an answer to a specific question. They shouldn’t have to think very hard about how to structure a query so that a search engine can understand it. It’s <em>our</em> job to be knowledgeable about both their language <em>and</em> their lingo. At Silverchair, we believe the searcher is never wrong (our version of “the customer is always right”).</p>
<p>Bob Wachter, with whom we’re privileged to work on two sites sponsored by the Agency for Healthcare Research and Quality (<a href="http://www.psnet.ahrq.gov/" target="_blank">PSNet</a> and <a href="http://webmm.ahrq.gov/" target="_blank">WebM&amp;M</a>), recently wrote a <a href="http://community.the-hospitalist.org/blogs/wachters_world/archive/2010/03/04/verb-alizing.aspx" target="_blank">humorous blog post</a> about the way his hospital colleagues at UCSF (and other hospitals) commonly turn the nouns of their everyday work life into verbs as a shorthand way of communicating. For example, a resident might report that she “heparinized” her patient, or that a patient ready for discharge has been “housed and spoused,” meaning it had been determined that the patient had somewhere to go and someone to care for him. In addition, he reports the creation of new terms based on healthcare IT functionality, based, for example, on the way buttons are named in an EHR (“I done-ed it”).</p>
<p>That <a href="http://community.the-hospitalist.org/blogs/wachters_world/archive/2010/03/04/verb-alizing.aspx" target="_blank">post</a> is a fun reminder of the many ways medical lingo—and language—evolve, and the importance of attentive, systematic approaches to managing and supporting the information needs of those who invent the common parlance of their specialty in the course of doing their work (we hope, while using the sites we develop for them).</p>
<br />Filed under: <a href='http://blog.silverchair.com/category/classificationtagging/'>classification/tagging</a>, <a href='http://blog.silverchair.com/category/search/'>search</a>, <a href='http://blog.silverchair.com/category/taxonomy/'>taxonomy</a> Tagged: <a href='http://blog.silverchair.com/tag/medical-terminology/'>medical terminology</a>, <a href='http://blog.silverchair.com/tag/search/'>search</a>, <a href='http://blog.silverchair.com/tag/semantic-tagging/'>semantic tagging</a>, <a href='http://blog.silverchair.com/tag/taxonomy/'>taxonomy</a>, <a href='http://blog.silverchair.com/tag/thesaurus/'>thesaurus</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/405/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/405/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/405/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/405/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/405/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/405/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/405/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/405/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/405/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/405/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=405&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2010/04/02/the-real-world-silverchair/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e71ef4b47e6ba5c898bdeabbb8e47d6e?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Elizabeth Willingham</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2010/04/the_real_world_logo_svg.png" medium="image">
			<media:title type="html">The Real World logo</media:title>
		</media:content>
	</item>
		<item>
		<title>What Is an “eBook?”</title>
		<link>http://blog.silverchair.com/2010/03/12/what-is-an-%e2%80%9cebook%e2%80%9d/</link>
		<comments>http://blog.silverchair.com/2010/03/12/what-is-an-%e2%80%9cebook%e2%80%9d/#comments</comments>
		<pubDate>Fri, 12 Mar 2010 23:01:20 +0000</pubDate>
		<dc:creator>Thane Kerner</dc:creator>
				<category><![CDATA[business trends]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=378</guid>
		<description><![CDATA[The term "eBook" is being used to describe a range of digital information products with widely varying functionality and purposes. This oversimplification is interfering with decision-making by both content producers and consumers.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=378&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>Two recently released eBook surveys—the <a href="http://highwire.stanford.edu/PR/HighWireEBookSurvey2010.pdf">HighWire Press<em> 2009 Librarian eBook Survey</em></a> and <a href="http://www.aptaracorp.com/index.php?/eBook-survey1.html">Aptara’s <em>eBooks: </em><em>Uncovering Their Impact on the Publishing Market</em></a>—serve as a catalyst to consolidate a set of inchoate notions that have multiplied in this season of bustling enthusiasm for everything “eBook.” Namely, what exactly <em>is</em> an eBook?</p>
<p>I propose that there exists broad confusion about what constitutes an eBook, and that this definitional problem is interfering with decision making by both content producers and consumers.</p>
<p>Any major product introduction by Apple commands intense awareness, and the <a href="http://www.apple.com/ipad/">iPad </a>was no exception. After the Kindle surge over the recent holidays—<a href="http://atlanticwire.theatlantic.com/opinions/view/opinion/Amazon-Announces-Kindle-Books-Outsell-Real-Books-2034"><em>Atlantic Monthly</em> reported</a> that on Christmas Day 2009, Amazon’s “e-book” [quotation marks mine] sales exceeded hard-copy sales for the first time in Amazon’s history—the lay public probably thinks it has a working definition. Likewise, the term is often invoked in the publishing world as if it is definitive (the recent <a href="http://www.toccon.com/toc2010/public/schedule/grid">O’Reilly Tools of Change conference</a> in New York included no fewer than­­ 11 sessions with “eBook” in the <em>title</em> (which list excludes many more that were substantively about the topic despite not meeting this particular criterion).</p>
<p>But in practical usage, a shared definition remains elusive. I have seen everything from a Kindle download to relational content databases referred to as an eBook. <a href="http://en.wikipedia.org/wiki/Ebook">Wikipedia </a>says an eBook is “an <a title="E-text" href="http://en.wikipedia.org/wiki/E-text">e-text</a> that forms the <a title="Digital media" href="http://en.wikipedia.org/wiki/Digital_media">digital media</a> equivalent of a conventional printed <a title="Book" href="http://en.wikipedia.org/wiki/Book">book</a>,” while the <a href="http://www.oed.com/">OED </a>defines an eBook as “an electronic version of a printed book which can be read on a personal computer or hand-held device designed specifically for this purpose.” However, <a href="http://www.planetebook.com/mainpage.asp?webpageid=70">Planet eBook says</a> it “can be anything from the digital version of a paper book, to more interactive content that includes hyperlinks and multimedia. It can even be the electronic reading device such as a Rocket eBook or Pocket PC.”</p>
<p>For purposes of this discussion, let’s oversimplify and identify two possible definitions: a <strong>Simple eBook</strong> (one title, unconnected, Wikipedia/OED-style) and a <strong>Rich eBook</strong> (interactive, interconnected, Planet eBook-style).</p>
<p><a href="http://semedica.files.wordpress.com/2010/03/ebookspectrum.jpg"><img class="alignleft size-full wp-image-380" title="EbookSpectrum" src="http://semedica.files.wordpress.com/2010/03/ebookspectrum.jpg?w=612&#038;h=286" alt="" width="612" height="286" /></a></p>
<p>The HighWire survey, aimed at Librarians, contains significant evidence of confusion between the two poles, both in its questions and in the user responses and comments. For example, responses to the final question, “<em>Are there any additional important factors not covered above that publishers should consider when publishing ebooks?” </em>include:</p>
<ul>
<li>&#8220;eBooks are most useful in medicine, computer science, and other areas where <strong>frequent</strong> <strong>updates</strong> are important.&#8221;</li>
<li>&#8220;Users use eBooks to <strong>answer a particular question, not to find a particular book</strong>. Therefore, a critical mass of material within one collection is important.&#8221;</li>
<li>&#8220;Promote title‐level and <strong>page‐level</strong> <strong>linking via social software </strong>and course management software.&#8221;</li>
<li>&#8220;eJournals have shown that it is possible for electronic publishing to <strong>transform the delivery and use of information</strong>, and the same can happen with eBooks.&#8221;</li>
</ul>
<p>These perspectives strongly imply a Rich eBook model. But in response to the same query, others said:</p>
<ul>
<li>&#8220;…linking <strong>index to actual page numbers</strong> (is an) important feature&#8221;</li>
<li>&#8220;Content should be available for <strong>course reserve&#8221;</strong></li>
<li>&#8220;A <strong>common user interface</strong> for ebooks is needed. This could be an open‐source product&#8221;</li>
</ul>
<p>…responses that suggest an understanding of “eBook” much more like our Simple model.</p>
<p>The Aptara survey, which collects responses from publishers, exclusively (if not explicitly) implies the Simple eBook model. One can quickly discern this focus in <a href="http://www.aptaracorp.com/index.php?/q4-devicesupport.html">the question about device support</a>. Note that the percentages sum to 100, even though certain of these platforms overlap (Kindle has a iPhone app, for instance, and many web-based content applications offer alternative interfaces optimized for smartphone access). (Rich eBooks will generally not function in dedicated hardware environments.)</p>
<p>As I encounter people trying to plan for their future enterprises (be they authors, publishers, service providers, librarians and other licensees, or individual content consumers), I find many are conflating information about this very disparate continuum of information products. If the terms of the research are ambiguous or incomplete, then it risks obfuscating market realities. For instance, HighWire’s survey reports that 81% of respondents have a budget for electronic resources above $100,000, yet 91% report that “eBooks” represent less than 10% of their overall collection budget. Not only do these two questions confuse the numeric scale ($ v %), but they fail to compare the same resources, first citing “electronic resources” then switching to “eBooks”. What comprises the former category? Is it electronic journals+eBooks? If so, what is included in and excluded from the latter? Where do electronic databases fit? What about what we’re temporarily herein calling “Rich eBooks?”</p>
<p>In professional publishing, the simple concept of a downloadable unit, the content of which is the precise equivalent of the print, has broken down. Sure, a reader can buy professional eBooks that meet this definition. But more important, prevalent, and useful (despite the current eBook hype) are content systems that are addressing the <em>fundamental</em> purposes of a professional book but radically rethinking the means by which we achieve them. This rethink expresses itself in both the creation and the consumption modes of the Resource Formerly Known As The Professional Book.</p>
<p>So, what are the purposes of a professional book? Education, broadly defined: we read a professional book to learn, either in a formal educational context, or in a maintenance/current-awareness mode. The primary audience reads to edify itself, at whatever level of abstraction and practical applicability, about subjects related to its professional activities. This is distinct from (if overlapping at the margins with) reading at leisure/for pleasure, which is the primary purpose of trade books. Authoring a professional book is an exercise in consolidating and reporting the state of the art, proposing new perspectives and ideas that will be added to a foundation of shared understanding, and/or integrating concepts from disparate disciplines. An author (or editor) gathers and organizes preexisting information, synthesizes it, and sometimes posits new or alternative conclusions, attempting to expand on the shared foundation of the domain.</p>
<p>Given the technological tools available, the simple eBook as defined above is insufficient to support the central function of professional books. It confines a reader to a quarantined jurisdiction, administered by an individual author or editor, fixed at the moment of its publication, unconnected to related information resources that are often crucial to the context of the shared domain.</p>
<p>As a result, I suspect the prospects for the Simple eBook model are much more limited in the professional domain than they are in the world of trade, where it is far more closely aligned with the fundamental purpose of reading (and helpfully solves certain problems like portability and access). Instead, the Resource Formerly Known As The Professional Book will, in the service of its fundamental educational purpose, morph into something much more interactive, dynamic, and temporal than a Simple eBook. In fact, resources like this have already developed, resources that make the professional (Simple) eBook obsolete even as the format gains ascendancy in the popular imagination and the trade market. If you think about it, you’re likely already familiar with rudimentary examples, and you probably use some of the tools that will converge to make this professional post-book a reality.</p>
<p>For instance, think of aggregated content applications—in medicine, products like <a href="http://www.mdconsult.com">MDConsult</a> (from Elsevier) and <a href="http://www.accessmedicine.com/">AccessMedicine </a>(McGraw-Hill). These systems allow a user to focus on addressing the educational (again, in the broadest sense) objective with multiple sources of information conveniently organized into a single interface. Even this now well-understood innovation renders the education process far more efficient than the use of individual eBooks. How about Wikipedia or YouTube, continuously evolving reference databases that are good places to start many investigations (and could never be a book)? Social networks, from the 800-pound gorilla Facebook to the exclusive and targeted <a href="http://www.cardioexchange.org/login">CardioExchange</a>, are increasingly being used to annotate and aggregate content around professional domains. And Macmillan has taken a next step (as have a number of large educational publishers, in various ways) with its launch of <a href="http://dynamicbooks.vitalbook.com/">Dynamic Books,</a> a platform that will allow wide-scale annotation, reordering, and other modifications of its textbooks.</p>
<p>Speaking of which, consider the content creation/generation side of the equation. Professionals benefit from, and increasingly rely upon, information that is vastly more current than that provided by books (paper <em>or</em> E). Networking technology has made the exchange of what we will broadly call “updates” massively more accessible. Updates—my placeholder term encompassing all sorts of timely (and often time-sensitive) content—can include blog posts, wiki edits, dynamic news links, peer-reviewed syntheses, and so forth.  Furthermore, connectivity to and between dynamic data sources (in medicine, for example, clinical trials registries, guideline clearinghouses, drug databases) can likewise enrich and propel vastly more integrated, comprehensive, and effective educational experiences.</p>
<p>Now imagine a future in which relevant information around a theme of user interest is dynamically aggregated across previous boundaries (publisher, format, repository, etc.). Wide deployment of digital publishing infrastructure (syntactic and now semantic XML, ontological standards, proliferation of web services standards, registries like DOI, and so on) is in place to begin delivering user experiences of ever-increasing richness, connectivity, efficiency, and dynamism. This is <em>not</em> your father’s Kindle.</p>
<p>(By the way, I am not a wide-eyed arcadian with respect to content sources. Nothing I’ve said should be taken to suggest that standards, pedigree, quality control, peer review, and other editorial values are either passé or automatic. Rigor in organizing and curating the Resource Formerly Known As The Professional Book will continue to be essential, and these qualities will be required to establish a new value proposition on top (not in lieu) of authors&#8217; and publishers’ domain expertise. As seems to always be the case in publishing, it’s more work on top of the old work.)</p>
<p>So, how does a Simple eBook compete with this vision? Not favorably. I’m hard pressed to envision more than a peripheral role for it in the professional information sphere of the future. Now, if we redefine eBook as something more comprehensive… or maybe we should just give it a new name? What are your suggestions?</p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="399" valign="top">
<h3><span style="color:#333399;">[<span style="text-decoration:underline;">your term here</span>]  (Rich eBook)</span></h3>
</td>
<td width="399" valign="top">
<h3><span style="color:#333399;">Simple eBook</span></h3>
</td>
</tr>
<tr style="padding-left:60px;">
<td width="399" valign="top">Dynamic Content</td>
<td width="399" valign="top">Static Content</td>
</tr>
<tr>
<td width="399" valign="top">Interactive Content</td>
<td width="399" valign="top">Unconnected Content</td>
</tr>
<tr>
<td width="399" valign="top">Multimedia</td>
<td width="399" valign="top">Text, Limited Graphics</td>
</tr>
<tr>
<td width="399" valign="top">Social-media Enabled</td>
<td width="399" valign="top">Isolated to Individual Utility</td>
</tr>
<tr>
<td width="399" valign="top">Continuity (Subscription) Revenue Model</td>
<td width="399" valign="top">Single-sale Revenue Model</td>
</tr>
<tr>
<td width="399" valign="top">Non-proprietary Device &amp; Platform</td>
<td width="399" valign="top">Proprietary Single-use Device &amp; Platform</td>
</tr>
<tr>
<td width="399" valign="top">Customizable</td>
<td width="399" valign="top">Fixed</td>
</tr>
<tr>
<td width="399" valign="top">Addresses Fundamental Educational Process</td>
<td width="399" valign="top">Replaces a Single Book 1-for-1</td>
</tr>
<tr>
<td width="399" valign="top">Lives in the Cloud</td>
<td width="399" valign="top">Stored on Your Device</td>
</tr>
<tr>
<td width="399" valign="top">Transforms Pedagogy</td>
<td width="399" valign="top">Reduces the Weight of Your Backpack</td>
</tr>
</tbody>
</table>
<br />Filed under: <a href='http://blog.silverchair.com/category/business-trends/'>business trends</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/378/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/378/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/378/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/378/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/378/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/378/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/378/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/378/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/378/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/378/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=378&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2010/03/12/what-is-an-%e2%80%9cebook%e2%80%9d/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/ff6595d89c4313b222ac18f2c7ea0eb7?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">thanek</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2010/03/ebookspectrum.jpg" medium="image">
			<media:title type="html">EbookSpectrum</media:title>
		</media:content>
	</item>
		<item>
		<title>Evaluation of Automated Tagging Solutions</title>
		<link>http://blog.silverchair.com/2010/02/04/evaluation-of-automated-tagging-solutions/</link>
		<comments>http://blog.silverchair.com/2010/02/04/evaluation-of-automated-tagging-solutions/#comments</comments>
		<pubDate>Thu, 04 Feb 2010 14:43:29 +0000</pubDate>
		<dc:creator>Jake Zarnegar</dc:creator>
				<category><![CDATA[classification/tagging]]></category>
		<category><![CDATA[semantic enrichment]]></category>
		<category><![CDATA[automated tagging]]></category>
		<category><![CDATA[Cortex]]></category>
		<category><![CDATA[semantic tagging]]></category>
		<category><![CDATA[Tagmaster]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=360</guid>
		<description><![CDATA[  As we at Silverchair and Semedica see more and more interest in automated tagging solutions (such as our Tagmaster system), we are more frequently encountering questions about how to evaluate their results. Here are a few ideas on the subject: Evaluation: Humans Required! It is hard to get around the fact that you will need human editors (or professional [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=360&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p style="text-align:center;"> </p>
<p>As we at Silverchair and Semedica see more and more interest in automated tagging solutions (such as our <a href="http://www.semedica.com/tagmaster.aspx" target="_blank">Tagmaster</a> system), we are more frequently encountering questions about how to evaluate their results. Here are a few ideas on the subject:</p>
<h1>Evaluation: Humans Required!</h1>
<p>It is hard to get around the fact that you will need human editors (or professional indexers) and your human technology team (who will use the tags to create interesting new features) to verify that an automated system is working correctly and that the tagging is accurate and useful. </p>
<p>Recently, someone asked our CEO Thane Kerner if we had an automated system to verify the accuracy of our automated tagging. Thane replied (rather cheekily, I must say): “If we had an automated review system that could measure tagging accuracy more precisely than the current tagging system, we wouldn’t use it to verify tags, we’d use it to tag the content to begin with!” The lesson: Once you’ve deployed your best automated system to do the tagging, humans are the next logical reviewers. </p>
<p>Here are four factors your humans should consider in their review:</p>
<div id="attachment_372" class="wp-caption alignright" style="width: 310px"><a href="http://semedica.files.wordpress.com/2010/02/tagmaster_content_page1.gif"><img class="size-medium wp-image-372" title="View inside Semedica's Tagmaster" src="http://semedica.files.wordpress.com/2010/02/tagmaster_content_page1.gif?w=300&#038;h=192" alt="View inside Semedica's Tagmaster" width="300" height="192" /></a><p class="wp-caption-text">View inside Semedica&#39;s Tagmaster, showing tags automatically inserted at the paragraph level</p></div>
<h2>1.  Expert/Editorial Accuracy Confidence</h2>
<p>One key target for evaluation is to assess how much confidence your key stakeholders (journal boards, editors, etc.) express in the output of the system. But confidence is not a linear equation. I posit the following values:</p>
<ul>
<li>Impeccable tag placement: +1</li>
<li>Debatable tag placement: −1</li>
<li>Debatable tag omission: −1</li>
<li>Obvious tag omission: −10</li>
<li>Obvious irrelevant tag placement: −50</li>
</ul>
<p>The first thing you’ll notice is the weight of positive to negative. In high-stakes fields (including science and medicine), humans are naturally biased to more heavily favor negative experiences.  (Of course, this has aided us well in survival: “Don’t eat that type of berry again, it made you sick last time!”) What that means in terms of confidence is that stakeholders will need a<em> disproportionate amount</em> of positive reassurance to get over negative outcomes. And the impact of a particularly egregious negative outcome (resulting from a particularly poorly placed tag) can be devastating to your stakeholder’s impression of a tagging system. (This is why Silverchair’s system defaults to using conservative methods with very little “guessing” to avoid obvious irrelevant tag placement.) </p>
<h2>2.  Usefulness!</h2>
<p>The next key target for evaluation for both editorial and technical stakeholders to assess is <em>usefulness</em> of the tagging applied. Tags should be highly relevant in a domain-specific context and they should drive better discoverability and linking. Primary care, genetics, surgery, and emergency care all take very different approaches to the same topics, and their tagging should reflect their uses. </p>
<p>The tagging system you are evaluating may have added tagged concepts that are tangential or irrelevant to the use model of the content, and such tags would not be capable of driving innovative site features (in many cases, tangential tagging actually <em>inhibits</em> the ability for new systems to work effectively). For example, it is a nice-to-have if your tagging system can recognize place names and person names, but if it misses or miscategorizes important topics like clinical trial names it doesn’t matter how many people or places it can tag. (Clinical trial acronyms can be particularly tricky to tag―<a href="http://blog.silverchair.com/2010/01/26/searches-for-clinical-trials-we-can-do-better/" target="_blank">see our post</a> about them.)</p>
<h2>3.  Granularity</h2>
<p>Does the system still work with “documents” or can it identify topics down to the section/paragraph/figure/table/equation level? At Silverchair we work with many dense medical chapters that may cover more than 200 distinct topics, so we see it as a necessity for our tagging system to break those documents down into smaller parts in order to deliver precise packets of highly relevant information to our users.</p>
<h2>4.  Control and Ongoing Improvement</h2>
<p>Any system selected is not going to be extremely accurate “out-of-the-box.” (I write that as a realist, not as a pessimist!) So during evaluation you must ask, “How easy is it to make impactful positive changes to the system?” This can take a variety of methods—some systems suggest manually selecting training documents for each topic or category (which can get onerous when you have 20,000 topics), some systems allow your software developers to go in and tinker with the code (you have data classification expert software developers, right?!?), and some systems allow you to load and use a taxonomy or thesaurus to aid in topic identification and tagging (assumes a taxonomy/thesaurus exists or can be created for your domain).</p>
<p>At Silverchair, we work primarily in medicine, which is a taxonomy-rich domain with an ever-growing list of topics. For that reason, we’ve chosen the last method as our control and improvement strategy. Our editors update our <a href="http://semedica.com/cortex.aspx" target="_blank">Cortex</a> medical taxonomy and its related thesaurus every day to keep pace with the topics being written about and searched for. </p>
<h1>Summary</h1>
<p>If you choose a system that 1) is accurate enough to instill confidence in your editorial team, 2) is useful enough to drive meaningful new features and improvements, 3) classifies your data at a granular level, and 4) is flexible enough to allow explicit control and ongoing improvements―you’ve made a wise purchase!</p>
<br />Filed under: <a href='http://blog.silverchair.com/category/classificationtagging/'>classification/tagging</a>, <a href='http://blog.silverchair.com/category/semantic-enrichment/'>semantic enrichment</a> Tagged: <a href='http://blog.silverchair.com/tag/automated-tagging/'>automated tagging</a>, <a href='http://blog.silverchair.com/tag/classificationtagging/'>classification/tagging</a>, <a href='http://blog.silverchair.com/tag/cortex/'>Cortex</a>, <a href='http://blog.silverchair.com/tag/semantic-tagging/'>semantic tagging</a>, <a href='http://blog.silverchair.com/tag/tagmaster/'>Tagmaster</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/360/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=360&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2010/02/04/evaluation-of-automated-tagging-solutions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/f98c3087939c2c744ccaa4a42b38d3e9?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Jake Zarnegar</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2010/02/tagmaster_content_page1.gif?w=300" medium="image">
			<media:title type="html">View inside Semedica's Tagmaster</media:title>
		</media:content>
	</item>
		<item>
		<title>Searches for Clinical Trials: We Can Do Better!</title>
		<link>http://blog.silverchair.com/2010/01/26/searches-for-clinical-trials-we-can-do-better/</link>
		<comments>http://blog.silverchair.com/2010/01/26/searches-for-clinical-trials-we-can-do-better/#comments</comments>
		<pubDate>Tue, 26 Jan 2010 04:55:03 +0000</pubDate>
		<dc:creator>Elizabeth Willingham</dc:creator>
				<category><![CDATA[classification/tagging]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[semantic enrichment]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[Clinical trial]]></category>
		<category><![CDATA[medical terminology]]></category>
		<category><![CDATA[semantic tagging]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=344</guid>
		<description><![CDATA[Clinical trials are popular targets of searches in medical journals. To deliver accurate search and browse results for them, semantic tagging and a semantic search engine are essential. The names of clinical trials are often long and unwieldy, as they try to describe the focus and mission of the trial in their name—for example, a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=344&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>Clinical trials are popular targets of searches in medical journals. To deliver accurate search and browse results for them, semantic tagging and a semantic search engine are essential.</p>
<div class="zemanta-img zemanta-action-dragged" style="display:block;margin:1em;">
<div class="wp-caption alignright" style="width: 250px"><a href="http://commons.wikipedia.org/wiki/Image:Map_of_Florida_highlighting_Jupiter.svg"><img class=" " title="Location of Jupiter in Palm Beach County, Florida" src="http://upload.wikimedia.org/wikipedia/commons/thumb/0/05/Map_of_Florida_highlighting_Jupiter.svg/300px-Map_of_Florida_highlighting_Jupiter.svg.png" alt="Location of Jupiter in Palm Beach County, Florida" width="240" height="190" /></a><p class="wp-caption-text">Location of Jupiter in Palm Beach County, Florida (image via Wikipedia)</p></div>
</div>
<p>The names of clinical trials are often long and unwieldy, as they try to describe the focus and mission of the trial in their name—for example, a clinical trial studying drug treatment of high cholesterol is “Arterial Biology for the Investigation of the Treatment Effects of Reducing Cholesterol 6–HDL and LDL Treatment Strategies.” Because of these long names, trials are more commonly known by their acronyms—in this case, “ARBITER 6–HALTS” trial—and no doubt their full names are being crafted to result in a catchy or apropos—or hopeful—acronym. For example, the acronym for the trial studying the effect of the drug Vytorin on cholesterol levels is “IMPROVE-IT.” (See this <a href="http://" target="_blank">blogpost</a> for some humorous trial names and acronyms.)</p>
<p>One of my pet peeves is the incorrect use of the word “acronym” to mean any abbreviation for a term. Actually an abbreviation is also an acronym <em>only</em> when the abbreviation spells a word or is a combination of letters that people can pronounce as a word. So yes—abbreviations of clinical trials are acronyms, and ah, there’s the rub for commonly used full-text <em>nonsemantic</em> search engines. A full-text search engine treats them like any other word.</p>
<p>So yikes—a PubMed search for “JUPITER” (the acronym for the trial “Justification for the Use of Statins in Prevention: an Intervention Trial Evaluating Rosuvastatin”) delivers the first two results correctly, but the third result appears because the name of the institution that issued the paper is in Jupiter, Florida! OK so yes—the PubMed search box tries to help you by suggesting “Jupiter trial” (98 results) … but it also suggests “Jupiter study” (257 results). People—the JUPITER trial and the JUPITER study are exactly the same thing to any searcher wanting to know about JUPITER. The number of results should be the same for both searches. And nobody searching PubMed for JUPITER wants to know more about Jupiter, Florida. Trust me.</p>
<p>We can do better. At Silverchair, our Cortex taxonomy contains a list of clinical trials and the accompanying thesaurus includes their acronyms, so when our tagging and retrieval systems encounter those concepts, we’re able to separate them from their normal English language counterparts and tag them correctly.  Yet another benefit of an automated tagging system supported by a robust and up-to-date medical thesaurus. It understands medical information and the health care professionals who depend on it so that we can give them <em>results</em>, not guesses.</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/26c0649c-4bea-4e59-9ef3-3b5cdd916f81/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=26c0649c-4bea-4e59-9ef3-3b5cdd916f81" alt="Reblog this post [with Zemanta]" /></a></div>
<br />Posted in classification/tagging, search, semantic enrichment, taxonomy Tagged: classification/tagging, Clinical trial, medical terminology, search, semantic tagging, taxonomy <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/344/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/344/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/344/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/344/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/344/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/344/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/344/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/344/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/344/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/344/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=344&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2010/01/26/searches-for-clinical-trials-we-can-do-better/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e71ef4b47e6ba5c898bdeabbb8e47d6e?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Elizabeth Willingham</media:title>
		</media:content>

		<media:content url="http://upload.wikimedia.org/wikipedia/commons/thumb/0/05/Map_of_Florida_highlighting_Jupiter.svg/300px-Map_of_Florida_highlighting_Jupiter.svg.png" medium="image">
			<media:title type="html">Location of Jupiter in Palm Beach County, Florida</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=26c0649c-4bea-4e59-9ef3-3b5cdd916f81" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Internal Memory vs. External Memory</title>
		<link>http://blog.silverchair.com/2009/12/07/internal-memory-vs-external-memory/</link>
		<comments>http://blog.silverchair.com/2009/12/07/internal-memory-vs-external-memory/#comments</comments>
		<pubDate>Mon, 07 Dec 2009 19:39:09 +0000</pubDate>
		<dc:creator>Jake Zarnegar</dc:creator>
				<category><![CDATA[classification/tagging]]></category>
		<category><![CDATA[semantic enrichment]]></category>
		<category><![CDATA[computer memory]]></category>
		<category><![CDATA[memory]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=330</guid>
		<description><![CDATA[As we were setting up a new external SAN (storage area network) on the Silverchair production web farm recently, the network engineer said something that caught my attention: “The web servers will be able to use the external SAN drives faster than their own internal memory.” At first that defied my expectations of “internal vs. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=330&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>As we were setting up a new external SAN (storage area network) on the Silverchair production web farm recently, the network engineer said something that caught my attention: “The web servers will be able to use the external SAN drives <strong><em>faster</em></strong> than their own internal memory.” At first that defied my expectations of “internal vs. external,” but when I thought about more, it made perfect sense.</p>
<p>The web servers are designed to execute application logic, store session tracking data, handle user interaction input, and synthesize, parse, and display data from a variety of sources—they are logic processing engines that handle data storage only when necessary. On the other hand, the SAN has one purpose—to store a large amount of data and enable a super-efficient data delivery channel that rapidly responds to content requests from the web servers.</p>
<p>The more I thought about it, the more I realized it was a fitting metaphor for how humans work. We are fantastic logic processing engines. We parse, synthesize, analyze, and use data input from a variety of sources to perform creative problem solving. And most importantly to this metaphor, we only store data internally when absolutely necessary. In the present day, the comprehensiveness and ubiquity of the Internet have allowed us to store an unprecedented amount of collective memory in external sources and access it from wherever we may be.</p>
<p>To be clear, human use of external memory did not arrive with the Internet—it has been around since the beginning of civilization. We are used to storing memory in external sources and freeing up our internal resources. Papyrus eliminated the need to memorize long epic poems. Abaci eliminated the need to memorize multiplication tables. (<em>NB</em>: Don’t try telling that to a 2nd grade teacher.) In modern medicine, drug handbooks store dosage and safety information that is too complex for doctors to memorize <em>in</em> <em>toto</em>. Phone numbers stored in our mobile phones eliminate the need to memorize the phone numbers of friends. We even store memories in our friends and family—I recently asked my wife, “What was the name of that hotel we liked in Chicago?” She knew, and voila, I had accessed my external memory successfully.</p>
<p>Alas, my comparison of human activity to Silverchair’s web farm breaks down at a key point. In many cases, accessing our external memory is <em>not</em> fast and efficient. Currently the external memory sources of humans are not deployed as efficiently as a SAN. Internet content sources can be hard to access, store content in highly variable forms, require a special vocabulary or technique to query, and return data in a way that does not suit our purpose.</p>
<p>This is the fundamental problem that Silverchair’s Semedica division addresses with semantic enrichment of data sources. We’re organizing a specific external memory category (in our case, online medical and health care information) in a way that allows it to be accessed more quickly and to return data in the right form for efficient use by clinicians and researchers. The less data that health care workers need to store internally, the more of their “processing time” can be used toward envisioning creative solutions for preventing and curing diseases. That is something that the Internet cannot do. (Yet.)</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/f9513dac-6816-4e9e-8fdf-f32ea02d43aa/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=f9513dac-6816-4e9e-8fdf-f32ea02d43aa" alt="Reblog this post [with Zemanta]" /></a></div>
<br />Posted in classification/tagging, semantic enrichment Tagged: classification/tagging, computer memory, memory <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/330/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/330/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/330/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/330/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/330/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/330/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/330/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/330/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/330/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/330/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=330&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2009/12/07/internal-memory-vs-external-memory/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/f98c3087939c2c744ccaa4a42b38d3e9?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Jake Zarnegar</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=f9513dac-6816-4e9e-8fdf-f32ea02d43aa" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Usefulness Trumps Fun for Search Autosuggest</title>
		<link>http://blog.silverchair.com/2009/11/20/usefulness-trumps-fun-for-search-autosuggest/</link>
		<comments>http://blog.silverchair.com/2009/11/20/usefulness-trumps-fun-for-search-autosuggest/#comments</comments>
		<pubDate>Fri, 20 Nov 2009 16:59:06 +0000</pubDate>
		<dc:creator>Pam Harley</dc:creator>
				<category><![CDATA[search]]></category>
		<category><![CDATA[semantic enrichment]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[autosuggest]]></category>
		<category><![CDATA[Google Suggest]]></category>
		<category><![CDATA[semantic tagging]]></category>
		<category><![CDATA[Slate]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=277</guid>
		<description><![CDATA[Many thanks to colleague Jake Zarnegar for pointing me toward Slate columnist Michael Agger’s Google Suggest contest. I’m sure you’ve experienced Google Suggest in action: as you type into the search box, Google offers suggestions that change dynamically as you type each letter of your query. The suggestions are sometimes spookily on target but many [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=277&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>Many thanks to colleague <a href="http://blog.silverchair.com/author/jakezarnegar/" target="_blank">Jake Zarnegar</a> for pointing me toward <em>Slate</em> columnist Michael Agger’s <a href="http://www.slate.com/id/2234019/" target="_blank">Google Suggest contest</a>.<em> </em></p>
<p><em> </em></p>
<p>I’m sure you’ve experienced Google Suggest in action: as you type into the search box, Google offers suggestions that change dynamically as you type each letter of your query. The suggestions are sometimes spookily on target but many times flat-out inappropriate.</p>
<p>You’ll find many examples of Google Suggest inappropriateness documented online. But the <em>Slate</em> contest took a different angle, challenging readers to explore the different suggestions made in response to a “less intelligent” Google query versus a “more intelligent” one.” <a href="http://www.slate.com/id/2234738/" target="_blank">The winners are in</a>:</p>
<blockquote><p>The winning entry … follows Google Suggest into the realm of moral inquiry. It doesn’t neatly divide into “less intelligent” and “more intelligent,” but it’s the best example I received of how one word can make all the difference. [Is it <em>wrong</em> to…] involves love affairs, God, and younger men. [Is it <em>ethical</em> to…] puts us on the plane of animal research, privacy concerns, and cooking the books.</p></blockquote>
<p>Putting aside the entertainment and cultural value of Google Suggest, how does it work? Like most things Google, <a href="http://labs.google.com/intl/en/suggestfaq.html" target="_blank">those details</a> are vague:</p>
<blockquote><p>Our algorithms use a wide range of information to predict the queries users are most likely to want to see. For example, Google Suggest uses data about the overall popularity of various searches to help rank the refinements it offers.</p></blockquote>
<p>On the Silverchair SCM web content management platform, we also use autosuggest to aid searchers. But there’s no mystery about how it works. Once three characters have been typed into the search box, our search engine starts matching the query against the index of semantic tags that have been applied to that specific content set from our Cortex biomedical taxonomy. Suggestions become more precise with each query character typed, and because we are matching against only those semantic tags applied to the content, the search results set is always targeted and relevant. Our search engine also checks each query against a database of taxonomy equivalents—synonyms, abbreviations, jargon—to normalize the search query and expand it to cover all possible matches.</p>
<p style="text-align:center;">
<div id="attachment_320" class="wp-caption aligncenter" style="width: 478px"><img class="size-full wp-image-320" title="Silverchair search autosuggest" src="http://semedica.files.wordpress.com/2009/11/autosuggest6.gif?w=468&#038;h=262" alt="Silverchair search autosuggest" width="468" height="262" /><p class="wp-caption-text">Silverchair search autosuggest</p></div>
<p>Because the content in the products Silverchair builds is tagged so granularly, we can often suggest a more precise term than many searchers start with. Our goals for autosuggest are to save time for users, speed them to the most relevant possible query, and return the most precise answer to their question. Try autosuggest for yourself on the <a href="http://www.accesssurgery.com/index.aspx" target="_blank">AccessSurgery</a> site we built for McGraw-Hill. Search autosuggest is just one of the many ways a robust taxonomy can promote content discovery.</p>
<p>(I wonder how many of you are running off to play with Google Suggest—a perfect Friday afternoon time-waster…)</p>
<p style="text-align:center;">
<div id="attachment_324" class="wp-caption aligncenter" style="width: 478px"><img class="size-full wp-image-324" title="Google Suggest" src="http://semedica.files.wordpress.com/2009/11/googlesuggest8.gif?w=468&#038;h=341" alt="Google Suggest" width="468" height="341" /><p class="wp-caption-text">Google Suggest</p></div>
<p><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/f076297e-8c1b-400f-9c4f-3aa54558c3d6/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=f076297e-8c1b-400f-9c4f-3aa54558c3d6" alt="Reblog this post [with Zemanta]" /></a></p>
<br />Posted in search, semantic enrichment, taxonomy Tagged: autosuggest, Google Suggest, search, semantic tagging, Slate <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/277/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=277&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2009/11/20/usefulness-trumps-fun-for-search-autosuggest/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/d4feb71f4be6fa21b7184ec2bfbc1b83?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Pam Harley</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2009/11/autosuggest6.gif" medium="image">
			<media:title type="html">Silverchair search autosuggest</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2009/11/googlesuggest8.gif" medium="image">
			<media:title type="html">Google Suggest</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=f076297e-8c1b-400f-9c4f-3aa54558c3d6" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>NIH Makes Big Strides Toward Funding Clarity, But Still Could Be Better!</title>
		<link>http://blog.silverchair.com/2009/11/06/nih-makes-big-strides-toward-funding-clarity-but-still-could-be-better/</link>
		<comments>http://blog.silverchair.com/2009/11/06/nih-makes-big-strides-toward-funding-clarity-but-still-could-be-better/#comments</comments>
		<pubDate>Fri, 06 Nov 2009 15:33:18 +0000</pubDate>
		<dc:creator>Jake Zarnegar</dc:creator>
				<category><![CDATA[classification/tagging]]></category>
		<category><![CDATA[semantic enrichment]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[Agency for Healthcare Research and Quality (AHRQ)]]></category>
		<category><![CDATA[Grant funding]]></category>
		<category><![CDATA[National Institutes of Health (NIH)]]></category>
		<category><![CDATA[RePORT (Research Portfolio Online Reporting Tool)]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=253</guid>
		<description><![CDATA[The NIH has rolled out their new RePORT (Research Portfolio Online Reporting Tool) web site for information on funding, grants, and NIH research. As someone who works on government grants and contracts, I’m happy with this new level of transparency and clarity as to what topics (and who!) is being funded. It is a big [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=253&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_257" class="wp-caption alignright" style="width: 190px"><a href="http://en.wikipedia.org/wiki/Apples_and_oranges"><img class="size-full wp-image-257" title="Apples_to_Oranges" src="http://semedica.files.wordpress.com/2009/11/apples_to_oranges.jpg?w=180&#038;h=124" alt="Apples to oranges comparison" width="180" height="124" /></a><p class="wp-caption-text">Image via Wikipedia</p></div>
<p>The NIH has rolled out their new <a href="http://report.nih.gov/" target="_blank">RePORT (Research Portfolio Online Reporting Tool) web site</a> for information on funding, grants, and NIH research. As someone who works on government grants and contracts, I’m happy with this new level of transparency and clarity as to what topics (and who!) is being funded. It is a big upgrade from the incumbent system, which was hard to navigate and understand.</p>
<p>The most useful area of the site to me is the <a href="http://report.nih.gov/rcdc/categories/" target="_blank">categorical spending section</a>. It really gives you an idea of NIH’s funding priorities—it offers over 200 categories of funding.</p>
<p>However, it still has ample room for improvement. Currently it is an alphabetical list that contains items that are hard to compare. Here are some example categories that are not equivalent in scope:</p>
<ul>
<li>Allergic Rhinitis (Hay Fever)</li>
<li>American Indians / Alaska Natives</li>
<li>Burden of Illness</li>
<li>Cancer</li>
<li>Cardiovascular</li>
<li>Clinical Trials</li>
<li>Conditions Affecting Unborn Children</li>
<li>Gene Therapy</li>
<li>Gene Therapy Clinical Trials</li>
<li>Genetic Testing</li>
<li>Genetics</li>
</ul>
<p>Some are very specific (hay fever), some are broad (cancer), some are ambiguous (cardiovascular), some take a completely different approach than the dominant disease/condition approach (American Indians/Alaska Natives), and some seem to be repetitive.</p>
<p>With a bit of work, this information could be turned from its current flat list expression into a multilevel taxonomy that allows users to slice it up in the ways that appeal to them (conditions or target populations, for example). Silverchair does this for the Agency for Healthcare Research and Quality on their <a href="http://psnet.ahrq.gov/" target="_blank">PSNet</a> patient safety clearinghouse. A small amount of classification work can go a long way in creating valuable new features—NIH has proven that with their RePORT upgrade, but I’d like to see them go farther.</p>
<p>I’d be happy to help out with the NIH site, but I’m not sure what category that would be funded under…</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/11b89d62-1cff-4672-917b-e96703a67171/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=11b89d62-1cff-4672-917b-e96703a67171" alt="Reblog this post [with Zemanta]" /></a></div>
<br />Posted in classification/tagging, semantic enrichment, taxonomy Tagged: Agency for Healthcare Research and Quality (AHRQ), classification/tagging, Grant funding, National Institutes of Health (NIH), RePORT (Research Portfolio Online Reporting Tool), taxonomy <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/253/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=253&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2009/11/06/nih-makes-big-strides-toward-funding-clarity-but-still-could-be-better/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/f98c3087939c2c744ccaa4a42b38d3e9?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Jake Zarnegar</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2009/11/apples_to_oranges.jpg" medium="image">
			<media:title type="html">Apples_to_Oranges</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=11b89d62-1cff-4672-917b-e96703a67171" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Quantifying the Obvious</title>
		<link>http://blog.silverchair.com/2009/10/16/quantifying-the-obvious/</link>
		<comments>http://blog.silverchair.com/2009/10/16/quantifying-the-obvious/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 16:35:30 +0000</pubDate>
		<dc:creator>Thane Kerner</dc:creator>
				<category><![CDATA[business trends]]></category>
		<category><![CDATA[Association of College and Research Libraries (ACRL)]]></category>
		<category><![CDATA[Association of Research Libraries (ARL)]]></category>
		<category><![CDATA[electronic resources spending]]></category>
		<category><![CDATA[library spending]]></category>
		<category><![CDATA[STM publishing industry]]></category>

		<guid isPermaLink="false">http://blog.silverchair.com/?p=239</guid>
		<description><![CDATA[I still encounter, with alarming frequency, STM publishing executives who have not yet grasped the implications of the digital revolution. There’s a sensibility that pigeonholes the web as yet another ancillary delivery tool—kinda like CD-ROMs. I am amazed to hear people in positions of serious authority (for now) declare their intentions to move incrementally to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=239&subd=semedica&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>I still encounter, with alarming frequency, STM publishing executives who have not yet grasped the implications of the digital revolution. There’s a sensibility that pigeonholes the web as yet another ancillary delivery tool—kinda like CD-ROMs. I am amazed to hear people in positions of serious authority (for now) declare their intentions to move incrementally to develop online businesses. “Let’s take it step by step and see what evolves,” I was recently told by a senior figure at a major scientific society.</p>
<p>Sorry, but it already <em>has</em> evolved, at least from a macro revenue perspective. If you are not persuaded merely by walking into a library at an institution of higher learning that education and research have gone digital, allow me to finish the job by providing some definitive evidence. As these data show, this is not in any way a peripheral phenomenon. It is a fundamental transformation of our industry. These data were culled from statistical reports created by the <a href="http://www.arl.org/stats/annualsurveys/arlstats/" target="_blank">Association of Research Libraries (ARL)</a> and the <a href="http://www.ala.org/ala/mgrps/divs/acrl/publications/trends/index.cfm" target="_blank">Association of College and Research Libraries (ACRL)</a>, and they vividly illustrate the trends that are sweeping the institutional customer base.</p>
<p>The first three tables show the trajectory of expenditures made for print materials and electronic materials in these libraries. (Obviously, in categories of this breadth, these proportions have an inverse relationship.)</p>
<p><img class="alignnone size-full wp-image-240" title="LibraryStats1" src="http://semedica.files.wordpress.com/2009/10/librarystats1.jpg?w=468&#038;h=349" alt="LibraryStats1" width="468" height="349" /></p>
<p>Among all libraries surveyed, the proportions equalized in 2008; almost certainly, spending for electronic resources will outstrip print in 2009.</p>
<p>When the list is narrowed to institutions that offer doctoral programs, there is a significant shift toward electronic resources.</p>
<p><img class="alignnone size-full wp-image-242" title="LibraryStats2" src="http://semedica.files.wordpress.com/2009/10/librarystats2.jpg?w=468&#038;h=335" alt="LibraryStats2" width="468" height="335" /></p>
<p>And finally, when we narrow it further, to focus exclusively on health science libraries, the contrast is even more stark: by 2007 (the most recent figures available), collection spending was already 2/3 electronic. (What must it be as I write, 2 years on?)</p>
<p><img class="alignnone size-full wp-image-243" title="LibraryStats3" src="http://semedica.files.wordpress.com/2009/10/librarystats3.jpg?w=468&#038;h=335" alt="LibraryStats3" width="468" height="335" /></p>
<p>It is common knowledge that serials have been trending electronic since the turn of the last decade. But now, materials of ALL kinds are moving to electronic delivery. Take a look, for example, at the trends specifically for electronic books.</p>
<p><img class="alignnone size-full wp-image-241" title="LibraryStats4" src="http://semedica.files.wordpress.com/2009/10/librarystats4.jpg?w=468&#038;h=349" alt="LibraryStats4" width="468" height="349" /></p>
<p>For many of us, this is not news. But the appropriate response is not to treat electronic platforms as peripheral or ancillary. This is obviously the future of the STM business.</p>
<br />Posted in business trends Tagged: Association of College and Research Libraries (ACRL), Association of Research Libraries (ARL), electronic resources spending, library spending, STM publishing industry <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/semedica.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/semedica.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/semedica.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/semedica.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/semedica.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/semedica.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/semedica.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/semedica.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/semedica.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/semedica.wordpress.com/239/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.silverchair.com&blog=8554914&post=239&subd=semedica&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://blog.silverchair.com/2009/10/16/quantifying-the-obvious/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/ff6595d89c4313b222ac18f2c7ea0eb7?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">thanek</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2009/10/librarystats1.jpg" medium="image">
			<media:title type="html">LibraryStats1</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2009/10/librarystats2.jpg" medium="image">
			<media:title type="html">LibraryStats2</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2009/10/librarystats3.jpg" medium="image">
			<media:title type="html">LibraryStats3</media:title>
		</media:content>

		<media:content url="http://semedica.files.wordpress.com/2009/10/librarystats4.jpg" medium="image">
			<media:title type="html">LibraryStats4</media:title>
		</media:content>
	</item>
	</channel>
</rss>