<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>MarBlog</title>
	<atom:link href="http://sci.overchord.net/feed/" rel="self" type="application/rss+xml" />
	<link>http://sci.overchord.net</link>
	<description>Marine Science and Data Management</description>
	<lastBuildDate>Sun, 06 Nov 2011 20:32:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>MEDIN &#8211; A way forward for UK Marine Science</title>
		<link>http://sci.overchord.net/2011/11/medin-a-way-forward-for-uk-marine-science/</link>
		<comments>http://sci.overchord.net/2011/11/medin-a-way-forward-for-uk-marine-science/#comments</comments>
		<pubDate>Sun, 06 Nov 2011 20:32:56 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Data Management]]></category>
		<category><![CDATA[DAC]]></category>
		<category><![CDATA[Data Archive Centres]]></category>
		<category><![CDATA[Data portal]]></category>
		<category><![CDATA[discovery services]]></category>
		<category><![CDATA[download services]]></category>
		<category><![CDATA[INSPIRE]]></category>
		<category><![CDATA[MEDIN]]></category>
		<category><![CDATA[Publishing data]]></category>
		<category><![CDATA[view services]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=116</guid>
		<description><![CDATA[The Marine Environmental Data &#38; Information Network has been on the scene for quite a few years now. I have been involved through my work, and attended the recent open partnership meeting, and it struck me that the recent move into an &#8220;operational phase&#8221; seems to have kickstarted a great leap forward. There were many [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://http://www.oceannet.org">Marine Environmental Data &amp; Information Network</a> has been on the scene for quite a few years now.</p>
<p>I have been involved through my work, and attended the recent open partnership meeting, and it struck me that the recent move into an &#8220;operational phase&#8221; seems to have kickstarted a great leap forward. There were many more organisations present at the partnership meeting than I think has previously been there. Crucially, there is a mix of public bodies (including NDPB&#8217;s) and private companies now, and this is important as MEDIN is aiming to embed itself early into contractual agreements for the collection of Marine Data.</p>
<p>By embedding itself, I mean that MEDIN supplied a number of functions and standards that are useful for partners to include in data collection contracts. By including these, the contractor takes on an obligation to include metadata in relevant format, and to stick to standards for certain types of data.</p>
<p>The process for including and utilising MEDIN is that there are several Data Archive Centres (DAC&#8217;s) which store data long term and make it available, initially through MEDIN&#8217;s <a href="http://http://portal.oceannet.org/search/full">discovery portal</a>, but also with view and download services planned for many areas. While there may be costs associated with committing large data sets to an archive centre, there are actually a number of great benefits, which I think are not fully realised by partners or data suppliers:</p>
<ul>
<li>Data are stored securely and handled by experts.</li>
<li>Data are made public (if so agreed)</li>
<li>The partner does not incur the costs of storing, backing up and maintaining Master Data Management aspects of these data sets</li>
<li>Partners bandwidth costs are not escalated by starting to publish data.</li>
<li>Data becomes part of a UK wide network of reporting data</li>
</ul>
<div>Each of the Data Archive Centres are accredited, and as such are secure places to put your data in terms of continued funding, secure handling and storage (it&#8217;s what they do for a living in most cases).</div>
<div>At the partnership meeting in London, there was a good debate about both partners who had great success with committing data to MEDIN, but also an open and frank discussion about a general reluctance towards submitting data sets. Some of the reasons discussed was that adding the MEDIN criteria to contracts would push up cost or limit the amount of data that could be collected at same cost. Further, submitting already held data was generally something people wnated to do, but found to have very little time towards, and little or no chance for spending money on getting the data curated with an external body (in this case a Data Archive Centre).</div>
<div>Such concerns are fully understandable, but is also a cause for concern. For one, it quite clearly demonstrates that many organisations  have difficulty in putting a maintenance costs on their data sets,  and that perhaps there sometimes is a sacrifice of quality over quantity of data.  Secondly, there are a number of costs not yet accounted for in holding your own data. In particular Marine Environmental data that will fall under the European INSPIRE Directive will be required to be made available through Discovery, View, Download and Transformation services. If each partner or data provider is to establish, run and maintain such services, it will be significantly more expensive than using the service offered by DAC&#8217;s.</div>
<div>So while MEDIN is certainly gathering pace, and now published more than 2,000 metadata records on their portal, with more coming in, and more work planned, there are certainly also challenges ahead. Not the least will be to create a more transparent process (and possibly even cost) for data providers to commit their (often multidisciplinary) data into (often discipline specific) DAC&#8217;s.</div>
<div>The time ahead will be exciting. There are clear indication that more organisations are sitting up and paying attention to the work done by MEDIN, and I think the next couple of years will be crucial to determine if the processes and standards can be made to work in a way that is streamlined enough to really encourage partners to get data up to those Archive Centres and to get them published so all will benefit.</div>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/11/medin-a-way-forward-for-uk-marine-science/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Did increased computing power break data management?</title>
		<link>http://sci.overchord.net/2011/09/did-increased-computing-power-break-data-management/</link>
		<comments>http://sci.overchord.net/2011/09/did-increased-computing-power-break-data-management/#comments</comments>
		<pubDate>Mon, 19 Sep 2011 12:46:00 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Data Management]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=106</guid>
		<description><![CDATA[With the risk of the title already having put you off, and labelling this post as lamenting progress over the &#8220;good old days&#8221;, I am simply offering a few observations. As part of a data resuce project, I&#8217;m currently reading the documentation for a plankton database system from 1994. The documentation pretty much covers everything [...]]]></description>
			<content:encoded><![CDATA[<p>With the risk of the title already having put you off, and labelling this post as lamenting progress over the &#8220;good old days&#8221;, I am simply offering a few observations.</p>
<p>As part of a data resuce project, I&#8217;m currently reading the documentation for a plankton database system from 1994. The documentation pretty much covers everything you would expect to find in a more modern documentation, e.g. a data model, work flows, methods and background code.  It was designed to run on an old VAX system, and of course there are some technical limitations that I am glad that we are over.</p>
<p>There are very clear limits on what could be put into tables, and the number of tables that were created. This is obviously a result of having to be careful about referencing. E.g. you did not reference long species names when an identifier was available as it would  slow your system to a crawl</p>
<p>However, looking at how very neatly data are structured in this old system, and looking to some of the more &#8220;modern&#8221; systems, it is actually far easier to re-extract and structure this data. All of the referential structure is intact even though the software itself is long gone, and I am left with nothing but the raw data. Looking at the raw data dump of many modern databases leaves something to be desired (IMO). The strict control of field formatting often goes out the window, and the &#8220;logic&#8221; of a system is frequently moved almost entirely to the front end, meaning that the DB itself sometimes doesn&#8217;t even contain all the referential information required.</p>
<p>Obviously you get what you pay for, and this is far from the case in many modern systems, but with the increased speed and power of computers, it has become ever so easier to throw clock cycles at a workflow problem rather than going back to the root causes.</p>
<p>Given the increase in computing power, I&#8217;m not sure if we have seen an equivalent increase in performance of databases. Granted we can store much, much more in there, and in a much wider range of formats, but does it encourage the cutting of corners rather than improvement of data models?</p>
<p>I&#8217;m happy to be proved wrong, and I suspect there isn&#8217;t a right or wrong, but there is certainly an observation that when you are forced to be economical with your clock cycles, there seems to be a higher attention to your data model.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/09/did-increased-computing-power-break-data-management/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>And now for something completely different &#8211; Timelapse of Earth</title>
		<link>http://sci.overchord.net/2011/09/and-now-for-something-completely-different-timelapse-of-earth/</link>
		<comments>http://sci.overchord.net/2011/09/and-now-for-something-completely-different-timelapse-of-earth/#comments</comments>
		<pubDate>Mon, 19 Sep 2011 12:31:26 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Earth]]></category>
		<category><![CDATA[ISS]]></category>
		<category><![CDATA[Space]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=109</guid>
		<description><![CDATA[This post is way off the leftfield of the normal topics on this blog, but this timelapse video of earth as seen from the International Space station was so impressive I just had to share it! &#160; &#160;]]></description>
			<content:encoded><![CDATA[<p>This post is way off the leftfield of the normal topics on this blog, but this timelapse video of earth as seen from the International Space station was so impressive I just had to share it!</p>
<p>&nbsp;</p>
<p><object width="500" height="375"><param name="movie" value="http://www.youtube.com/v/74mhQyuyELQ?version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/74mhQyuyELQ?version=3" type="application/x-shockwave-flash" width="500" height="375" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/09/and-now-for-something-completely-different-timelapse-of-earth/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>XML, Metadata, and setting up for harvest</title>
		<link>http://sci.overchord.net/2011/07/xml-metadata-and-setting-up-for-harvest/</link>
		<comments>http://sci.overchord.net/2011/07/xml-metadata-and-setting-up-for-harvest/#comments</comments>
		<pubDate>Fri, 29 Jul 2011 13:19:30 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Online Data Sources]]></category>
		<category><![CDATA[eGMS]]></category>
		<category><![CDATA[GEMINI]]></category>
		<category><![CDATA[GeoNetwork]]></category>
		<category><![CDATA[MEDIN]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[OAI]]></category>
		<category><![CDATA[sharing]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=102</guid>
		<description><![CDATA[In rural Scotland, it&#8217;s the time where harvesters starts being a common image on the fields. But I am setting up for a different kind of harvest &#8211; the harvesting of my organisation&#8217;s metadata. Creating metadata is a great way to get an oerview internally, but there are increasing demands on public authorities to also [...]]]></description>
			<content:encoded><![CDATA[<p>In rural Scotland, it&#8217;s the time where harvesters starts being a common image on the fields. But I am setting up for a different kind of harvest &#8211; the harvesting of my organisation&#8217;s metadata.</p>
<p><span id="more-102"></span></p>
<p>Creating metadata is a great way to get an oerview internally, but there are increasing demands on public authorities to also share the metadata externally. Apart from the legal requirements for serving spatial data and metdata, there is a plethora of ways in which this can be done, and a lot of it comes down to how deep your pockets are, and how many staff hours/days/months/years you are willing to spend on it.</p>
<p>One of the characteristic of most metadata is that it adheres to a known standard. This standard defines the information fields used to describe data holdings, and also details if a fields is mandatory to fill in, optional or sometimes conditional, depending on the settings on other fields.</p>
<p>Common to a lost of these standards are some basic fields, like title, description/abstract, originator, publication and modificaiton dates for example. But some standards goes much further, and start providing much more detail on the source, formats and extent of the data sources.</p>
<p>There is generally considerable overlap between a lot of metadata stanards, and this is one of the sources of frustraiton for people beginning the process of working up metadata in my experience. With a bewildering array of standards, with sometimes only subtle differences, it can be difficult to get an overview of which one to start with. However, my strong recommendation is to spend some time on this aspect. Yes it involves reading long, tedious documentation and comparing across standards. But by getting this knowledge early on, you can avoid setting yourself up for disappointment. Some standards will be supersets of others, and as such require more information. Therefore, it is important to be clear on what your obligations and aims are in terms of capturing and serving information. There&#8217;s no need to capture more that needed- the only thing to come out of that is that either you or your staff spend more time on creating the records, or in worst case, abandon creating metadata, because it takes too long.</p>
<p>Next thing is to ensure that you have captured all the required fields for the stanadrds that you are going to deliver in. For example, in my organisaiton, we have a database which was natively in an<a href="http://www.esd.org.uk/standards/egms/"> eGMS</a> metadata format. However, we actually need to serve data externally in a <a href="http://location.defra.gov.uk/wp-content/uploads/2010/09/GEMINI2_1_published.pdf">UK GEMINI</a> and <a href="http://www.oceannet.org/marine_data_standards/medin_disc_stnd.html">MEDIN</a> metadata formats.  But rather than completely changing the existing 750 records, we could add some fields, and slightly modify existing ones. This allowed us to capture all the necessary information to comply with new formats even though we strictly speak still worked with an eGMS database.</p>
<p>In this post i&#8217;m not going into the creation and user engagement of metadata, but rather the &#8220;behind the scenes&#8221; element of getting things out. With a datbase, we got the ability to write out the data in XML format.</p>
<p>And this is where its currently at. We can export XML files form a SQL database. These are then validated in two ways. First they are validated for the standards that the XML format (in this case the MEDIN format) lists in its header, and the file&#8217;s conformity to those standards as well and generic XML formatting rules.</p>
<p>Once that hurdle is crossed we&#8217;re moving into actual publishing of the metadata, and as mentioned, there are many ways to do this, depending on the depth of pockets etc.</p>
<p>Commonly to comply with harvesters from other portals, we must be able to deliver the metadata in either a <a href="http://www.openarchives.org/OAI/openarchivesprotocol.html">OAI-PMH</a> format and act as a <a href="http://en.wikipedia.org/wiki/Catalog_Service_for_the_Web">catalogue server</a>. To comply with these abilities, we&#8217;re deploying <a href="http://geonetwork-opensource.org/">geonetwork</a> which will accept the individual XML records created locally, and serve them up up the web, both through a browsable and searchable interface, and runnign a CWS at the same time.</p>
<p>So in future posts, i will share about more about experiences about actually releasing and harvesting metadata, but to summarise the recommendations so far:</p>
<p>1. Get to know your standards. The documentation is dry, but necessary. You can save yourself a lot of work by investigating these early on</p>
<p>2. Don&#8217;t be afraid to mix&#8217;n match internally. If a combination of one or more standards suit your organisaiton better, then it&#8217;s not really a problem. As long as you can populate the standards that you want to share externally, you can have more or less fields (or even translate you internal formats to the standard) on the inside.</p>
<p>3. Consider the scale of distribution. How central is this to your business and who is going to use it? If metadata is at the core of your MDM, you prbably want full integration so the metadata is always close to the data, and perhaps even linekd directly to it in yur applications (e.g. GIS). But for others it may be a low number of records, and they can be hand edited and published.</p>
<p>4. More than one standard &#8211; or more than one harvester coming to visit? If you know who are going to be your main customers in terms of ingesting your accessible metadata you can set up to target collections for these. It also means that its quite easy to serve the same metadata to multiple groups, simply by &#8220;tagging&#8221; them &#8211; at least in Geonetwork.</p>
<p>5. Ask the devleopers. Wen you read metadata and methodology documentation, it can be easy to think of it all as terribly complex and &#8220;i probably don&#8217;t understand it&#8221;. But if something does not make sense- ask the writer or developers! In a couple of cases my own questions have turned out to be errors in the documentation and they have been updated and corrected as a consequence &#8211; thus makes it easier for the next person reading it.</p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/07/xml-metadata-and-setting-up-for-harvest/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TechTalk: WinMerge makes comparing files a lot easier</title>
		<link>http://sci.overchord.net/2011/06/techtalk-winmerge-makes-comparing-files-a-lot-easier/</link>
		<comments>http://sci.overchord.net/2011/06/techtalk-winmerge-makes-comparing-files-a-lot-easier/#comments</comments>
		<pubDate>Sat, 25 Jun 2011 22:00:12 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=97</guid>
		<description><![CDATA[Pardon me for a deviation into application specific topics, but as a working data manager in the middle of a massive migration of content between systems, I had to shared my recent admiration for the Open Source WinMerge product. When it comes to moving files around, it is a task that at face value seems [...]]]></description>
			<content:encoded><![CDATA[<p>Pardon me for a deviation into application specific topics, but as a working data manager in the middle of a massive migration of content between systems, I had to shared my recent admiration for the Open Source <a href="http://sourceforge.net/projects/winmerge/">WinMerge</a> product.</p>
<p>When it comes to moving files around, it is a task that at face value seems incredibly easy, but when you scale it to many servers in terrabyte volumes, and add permissions for hundreds of groups to the mix, then it start becoming a bit of a job.</p>
<p>First part was the straight forward bit of getting things copied, and then synched &#8211; Robocopy is the workhorse here &#8211; but ultimately, you will probably want to make sure everything got included in those scripts, and this is where WinMerge comes in. The product has several modes of comparison for very fast size comparisons between two sets of folders to full binary comparison of all content. This is a major strength when working large scale as oyu can run fast scans, and home in on any issues this way.</p>
<p>It also allow you to fluently merge across from one set of fodlers to the other if you determine problems, but I mainly use it as a diagnostic tool &#8211; and so far, I&#8217;ve not found anything near it in user friendliness or performance.</p>
<p>The fact that it is an open source tool (GPL) tool only makes me happier &#8211; it is great to see collaborative projects that really nail some of the vaccums in the market!</p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/06/techtalk-winmerge-makes-comparing-files-a-lot-easier/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Qualia</title>
		<link>http://sci.overchord.net/2011/03/data-qualia/</link>
		<comments>http://sci.overchord.net/2011/03/data-qualia/#comments</comments>
		<pubDate>Wed, 02 Mar 2011 09:04:58 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Online Data Sources]]></category>
		<category><![CDATA[Data Qualia]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Qualia]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=94</guid>
		<description><![CDATA[First off &#8211; No i&#8217;m not so rushed that the heading is a type-o. It is instead the title of a short but interesting blog post by Jim Harris over at the OCDQ blog. In this post, Jim covers how the word &#8220;Qualia&#8221; is used to describe subjective quality of conscious experience, and how very, [...]]]></description>
			<content:encoded><![CDATA[<p>First off &#8211; No i&#8217;m not so rushed that the heading is a type-o.</p>
<p>It is instead the title of a short but interesting blog post by Jim Harris over at the <a href="http://www.ocdqblog.com" target="_blank">OCDQ blog</a>.</p>
<p>In this<a href="http://www.ocdqblog.com/home/data-qualia.html" target="_blank"> post</a>, Jim covers how the word &#8220;Qualia&#8221; is used to describe subjective quality of conscious experience, and how very, very, relevant it is in terms of data quality management.</p>
<p>Anything above and beyond the most basic data quality checking is really in the eye of the beholder. One of the common memes of DQ is that it is &#8220;fit for purpose&#8221;. Although fitness of data is important, it is the more qulia than quality related element of the purpose that raise issues, especially when it comes to sharing data.  Trying to predict the purpose of someone accessing data and ensuring that the data are fit for this purpose is virtually impossible. Yet it is an issue that becomes important when organisations and public authorities may otherwise have to include lengthy licenses to remove doubt about liability.</p>
<p>So perhaps the term Data Qualia is one we will see more often in the future <img src='http://sci.overchord.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/03/data-qualia/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Getting work done: Anywhere but the office</title>
		<link>http://sci.overchord.net/2011/02/getting-work-done-anywhere-but-the-office/</link>
		<comments>http://sci.overchord.net/2011/02/getting-work-done-anywhere-but-the-office/#comments</comments>
		<pubDate>Fri, 11 Feb 2011 08:40:29 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[efficiency]]></category>
		<category><![CDATA[Inspiration]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Meetings]]></category>
		<category><![CDATA[Time]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=89</guid>
		<description><![CDATA[In his TED talk, Jason Fried, pretty much nails it &#8211; why we don&#8217;t manage to get work done at work! Managers and Meetings are the prime source of interruptions and articifial divisions that means you never make it through a full &#8220;work-cycle&#8221; he explains in the talk. There is a lot of truth in [...]]]></description>
			<content:encoded><![CDATA[<p>In his <a href="http://www.ted.com/talks/jason_fried_why_work_doesn_t_happen_at_work.html" target="_blank">TED talk</a>, Jason Fried, pretty much nails it &#8211; why we don&#8217;t manage to get work done at work!</p>
<p>Managers and Meetings are the prime source of interruptions and articifial divisions that means you never make it through a full &#8220;work-cycle&#8221; he explains in the talk. There is a lot of truth in this, in that meetings very often tend to generate two things:  more meetings and changed objectives.</p>
<p>So why am i even including this on a data management blog? Well for starters, most data managers work in some kind of office with some colleagues, and quite likelyt can be the source of thew two major evils to other workers: management and meetings. As someone with the word &#8220;management&#8221; directly in the title, I guess we are perhaps jsut native interruptions to other workers in an organisation, and the need to call meetings to get combined strategies and systems in place, pretty much makes us walking interruption bombs if we&#8217;re not careful.</p>
<p>So what can we do in terms of getting good management of data, but without being a constant source of interruption to others? Wll i think that the job of a data managemer is working as a conduit, translating needs, and bridging the technology and business goals of the organisations we work for. Jsut like everyone else, we&#8217;ll need periods of uninterrupted work to be creative, and to come up with good solutions &#8211; but we also need to respect others need for the same. SO it&#8217;s a two-way street &#8211; reduce your own interruptions, and be careful about interrupting others. Perhpas use email or IM for something that does not require a room full of people for two hours to reach consensus on.  Perhaps consider that the door shouldn&#8217;t <em>always</em> be open. It&#8217;s good to be approachable &#8211; but you need time to concentrate as well.</p>
<p>I think preparation is key as well. When it finally comes to having some of the meetings that simply are necessary &#8211; make sure you are prepared. Identify what you need to do at the meeting before you sit in the conference room! Don&#8217;t let anyone run away with the agenda or hijack the meeting &#8211; make a plan and stick to it. There are meetings that are productive &#8211; but those are the ones where people walk out afterwards and have a clear idea of what was achieved, and who is doing what. Those kinds of meetings takes time to prepare for &#8211; uninterrupted time <img src='http://sci.overchord.net/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p><!--copy and paste--><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="446" height="326" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="wmode" value="transparent" /><param name="bgColor" value="#ffffff" /><param name="flashvars" value="vu=http://video.ted.com/talks/dynamic/JasonFried_2010X-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/JasonFried-2010X.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=1014&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=jason_fried_why_work_doesn_t_happen_at_work;year=2010;theme=new_on_ted_com;theme=not_business_as_usual;theme=a_taste_of_tedx;event=TEDxMidwest;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;" /><param name="src" value="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" /><param name="bgcolor" value="#ffffff" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="446" height="326" src="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" wmode="transparent" bgcolor="#ffffff" allowfullscreen="true" allowscriptaccess="always" flashvars="vu=http://video.ted.com/talks/dynamic/JasonFried_2010X-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/JasonFried-2010X.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=1014&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=jason_fried_why_work_doesn_t_happen_at_work;year=2010;theme=new_on_ted_com;theme=not_business_as_usual;theme=a_taste_of_tedx;event=TEDxMidwest;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/02/getting-work-done-anywhere-but-the-office/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Discussing Data: LinkedIn Groups</title>
		<link>http://sci.overchord.net/2011/02/discussing-data-linkedin-groups/</link>
		<comments>http://sci.overchord.net/2011/02/discussing-data-linkedin-groups/#comments</comments>
		<pubDate>Wed, 02 Feb 2011 10:00:08 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Data Management]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Online Data Sources]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=85</guid>
		<description><![CDATA[Data Managers benefit from discussing with peers, jsut like in any other trade &#8211; and perhaps even more so because of the rapid technological development pace. However, as cash is strapped during this financial downturn, many may find travel options limited and will turn to the web instead (or in addition to the personal face [...]]]></description>
			<content:encoded><![CDATA[<p>Data Managers benefit from discussing with peers, jsut like in any other trade &#8211; and perhaps even more so because of the rapid technological development pace. However, as cash is strapped during this financial downturn, many may find travel options limited and will turn to the web instead (or in addition to the personal face to face meetings).</p>
<p>There are of course many, many places on the web where you can start discussing, but I often find that locating an active community can be a bit of a challenge. LinkedIn is a thriving community with many active groups, mainly focussed on the professional aspects of our lives. The benefit of some of these groups is that they include real life professionals with clear credentials, and have groups where people actively debate issues that are close to the hearts of data managers, ranging from how best to implement quality management through to provacy and ownership issues.</p>
<p>In this post, I have gathered a short list of som e of the groups i enjoy reading and participating in. Note that most of them you will have to request to join &#8211; but it helps reduce the amount of spam and keeps the discussion focussed amongst people with relevant interests. I hope you might find it useful, and if you have other sites/communities or groups, please do share <img src='http://sci.overchord.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><a href="http://www.linkedin.com/groups?mostPopular=&amp;gid=1891037" target="_blank">Data Ownership in the Cloud</a></p>
<p>The Data Ownership in the Cloud group on LinkedIn is a global venue for multi-disciplinary networking between technologists and non-technologists interested in providing thought leadership on this critical issue. </p>
<p><a title="This group is members only" href="http://www.linkedin.com/groups?home=&amp;gid=41393&amp;trk=anet_ug_hm">IAIDQ Information/Data Quality Professional Open Community</a></p>
<p>LinkedIn.iaidq.org is an open community for Information Quality, Data Quality and Data Governance professionals (practitioners, consultants, academics, vendors etc.) to support collaboration, learning networking and interaction in a vendor neutral format. </p>
<p><a title="This is an open group" href="http://www.linkedin.com/groups?home=&amp;gid=1853443&amp;trk=anet_ug_hm">Obsessive-Compulsive Data Quality (OCDQ)</a></p>
<p>This is a networking group for Obsessive-Compulsive Data Quality (OCDQ), which is an independent blog offering a vendor-neutral perspective on data quality. </p>
<p>Vendor-neutral does not mean no vendor related content. When the products and services of vendors are presented or discussed, it will be done in an objective manner. </p>
<p>The goal is to foster an environment in which a diversity of viewpoints is freely shared without bias. Everyone is invited to get involved in the discussion and have an opportunity to hear what others have to offer.</p>
<p><a title="This group is members only" href="http://www.linkedin.com/groups?home=&amp;gid=812977&amp;trk=anet_ug_hm">Talend Open Source Data Integration</a></p>
<p>Talend is the first provider of open source data integration solutions, used primarily for ETL for business intelligence/data warehousing, data synchronization, data migration, operational data integration, data quality and MDM. </p>
<p>This group enables Talend users (and friends) to share information, news, discussions, and ideas about their Talend projects &#8211; and anything else of interest for the Talend community! </p>
<p><a title="This group is members only" href="http://www.linkedin.com/groups?home=&amp;gid=1845741&amp;trk=anet_ug_hm">Data Management Utopia</a></p>
<p>The purpose of this group is to bring together a professional group of individuals to collaborate about issues, problems, situations within the data space. </p>
<p>We all know that the world of enterprise data is sometimes hard to control and understand but together we can make a difference and learn from one another&#8217;s experiences. Whether you’re dealing with Oracle, SAP, IBM, etc. it doesn&#8217;t matter we still share some of the same pain points. </p>
<p>This group encourages everyone to share their lessons learned, thought leadership ideas, best practices, etc. that will educate and promote the development of world class solutions.</p>
<p><a title="This is an open group" href="http://www.linkedin.com/groups?home=&amp;gid=3374609&amp;trk=anet_ug_hm">Habitats &#8211; (Social Validation of INSPIRE Annex III Data&#8230;</a></p>
<p>Habitats (Social Validation of INSPIRE Annex III Data Structures in EU Habitats) related spatial data is critical in the management of Europe’s bio-diversity. INSPIRE needs work here, particularly in its Annex III data themes: 16-18 (Sea regions, Bio-geographical regions, Habitats &amp; biotopes, Species distribution).</p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/02/discussing-data-linkedin-groups/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Management as a search term</title>
		<link>http://sci.overchord.net/2011/01/data-management-as-a-search-term/</link>
		<comments>http://sci.overchord.net/2011/01/data-management-as-a-search-term/#comments</comments>
		<pubDate>Thu, 13 Jan 2011 12:11:18 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Online Data Sources]]></category>
		<category><![CDATA[Google Trends]]></category>
		<category><![CDATA[Searching]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=77</guid>
		<description><![CDATA[When going out on the World Wide Web, searching for solutions, discussions, news, and just about everything else relating to a topic you might be interested in, we leave a little footprint behind. These footprints of what has been searched for can at times be interesting to study. It is not just newspapers that in the lead [...]]]></description>
			<content:encoded><![CDATA[<p>When going out on the World Wide Web, searching for solutions, discussions, news, and just about everything else relating to a topic you might be interested in, we leave a little footprint behind. These footprints of what has been searched for can at times be interesting to study. It is not just newspapers that in the lead up to the New Year reports on the most sought after words on the web by using these tools; they can also give you a little insight into topic areas and how they are developing.</p>
<p><a href="http://trends.google.com" target="_blank">Google trends</a> provides an option for combining multiple search terms and look at their trends, either on a global, or national basis (or even sub regions  if it is very popular search terms).  This tool can be quite useful for understanding a little about the field you work in. Enter some common search terms that you think would be logical  variations on a topic, and see how they stack up against each other.</p>
<p>No doubt people can argue long and hard about the selection of search terms that you want to combine to examine trends. In this case, I have taken the &#8220;data management term&#8221; to see how much it is used in comparison with two related subject &#8220;GIS&#8221; and &#8220;databases&#8221;. While in many ways they cover slightly different aspects, there is also a good degree of common ground. People who look for databases terms are involved in a form of data management or development, and GIS users are perhaps some of the most prominent users of data that require careful management of the source data.</p>
<p>I limited the search to UK, as I was mainly interested in looking at the development over time. I was half expecting a steady increase in the GIS term, given many more users, and increasing legal requirements surrounding spatial data and representation. However, when the three terms are combined, GIS is quite stable, while the data management term has developed in the early years, and then stayed relatively stable over the past 2-3 years. It is interesting to see that data management was not on the minds of many people prior to 2005, so it is indeed still a young field.</p>
<div id="attachment_80" class="wp-caption aligncenter" style="width: 560px"><a href="http://sci.overchord.net/wp-content/uploads/2011/01/trends.jpg"><img class="size-full wp-image-80  " title="trends" src="http://sci.overchord.net/wp-content/uploads/2011/01/trends.jpg" alt="" width="550" height="323" /></a><p class="wp-caption-text">Google Trends for 3 search terms in the UK: &quot;data management&quot;, &quot;GIS&quot;, and &quot;databases£</p></div>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2011/01/data-management-as-a-search-term/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>BODC&#8217;s New data catalogue</title>
		<link>http://sci.overchord.net/2010/11/bodcs-new-data-catalogue/</link>
		<comments>http://sci.overchord.net/2010/11/bodcs-new-data-catalogue/#comments</comments>
		<pubDate>Wed, 24 Nov 2010 11:52:12 +0000</pubDate>
		<dc:creator>Jens</dc:creator>
				<category><![CDATA[Online Data Sources]]></category>
		<category><![CDATA[BODC]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Data portal]]></category>
		<category><![CDATA[Oceanography]]></category>
		<category><![CDATA[open data]]></category>

		<guid isPermaLink="false">http://sci.overchord.net/?p=72</guid>
		<description><![CDATA[British Oceanographic Data Centre has jsut announced a new facility on their website to search and retrieve data series directly from the web. While a lot of data could be retrieved before, this catalogue truly opens up access across all categories and project, with over 76,000 data series being put online in a searchable format. [...]]]></description>
			<content:encoded><![CDATA[<p>British Oceanographic Data Centre has jsut announced a new facility on their website to search and retrieve data series directly from the web. While a lot of data could be retrieved before, this catalogue truly opens up access across all categories and project, with over 76,000 data series <a href="https://www.bodc.ac.uk/data/online_delivery/nodb/">being put online </a>in a searchable format. The series are mainly CTD casts, but also include bathymetry meterology, optical properties, wave data and more.</p>
<p>The great thing is that data is available in several recognised formats, NetCDF, ODV and ASCII files &#8211; so virtually everyone in the field can access this data in a preferred format.</p>
<p>There are some limitations in terms of the way you can refine searches, but most of them makes sense from the perspective of optimising searches and not hanging up the server in searches that return virtually everything.</p>
<p>By the time you have narrowed your search criteria to return 1,000 series or less, you can retrieve results. There&#8217;s the option of downloading a KML file of coverage, and you can retrieve data in your preferred format.</p>
<p>It is important to note that we&#8217;re talking data series, not individual points here, so even a single series can contain thousands of data points, giving you access to a seriously large amount of oceanographic data with a wide geographic coverage.</p>
<p>The initial map on the <a href="https://www.bodc.ac.uk/data/online_delivery/nodb/">start page </a>show waters around Britain, but make sure you either zoom out or pan around as there is data from a much wider region &#8211; virtually all of thw world - than what is shown on the map.</p>
<p>You do have to register an account with BODC in order to checkout your &#8220;data shopping&#8221;, but there is a huge amount of data freely available. The map tells you up front which data series are freely available.</p>
<p>BODC has truly made their data a lot more accessible with this exercise.</p>
]]></content:encoded>
			<wfw:commentRss>http://sci.overchord.net/2010/11/bodcs-new-data-catalogue/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

