Tuesday, October 30, 2007

Value Proposition is an unsexy turn of phrase, but...

Like many other folk working in IT, I'm occasionally inclined to throw around the occasional metaphor to clarify a dense point when needed. Comparison and analogy are the hammers that force the square peg of understanding into the round hole of blissful ignorance.

I wonder, in the thinking out loud way I like to do, how many people involved in critical decision-making in libraries use the comparative process to relate their own personal day-to-day usage of computers and digital resources back to their profession, to determine the relevance of what the library does, to the average individual. Consider these questions:

Q - How many librarians have found themselves looking aghast at their home computer as it suffered some digital fatality, taking documents, photos, and media with it?
A - Over time, a fair few.

Q - Next, how many of those librarians had the posthumous realization: "Gosh, these things really are quite fragile...if only there was some sort of central institution or resource that could professionally handle and keep accessible large volumes of information as needed!"
A - Based on the services currently available thru libraries, I'd guess either none, or few - made distinct by the looks of frustration on their faces.

[There would be many, many other questions that could be asked to explore the nature of the considerations of librarians about computers, but I'm going to leave it at these two particularly, because they are -

  • common usage scenarios
  • illustrative of need, a great and stubborn justification for pretty much anything.
End parentheses.]

Consider the librarian in the second question - the situation could be easily addressed in several ways: Firstly, by that librarian having a robust personal backup process for stuff on their computer, which obviously isn't always the case. Secondly, by that librarian having uploaded copies of valued data to a remote filestore via the internet - which makes a good case for the irrelevance of libraries in storing digital information. At present, libraries present a sort of rarified menu of digital services and objects - we store digitised items of cultural relevance based on a contemporary evaluation of the items. The stuff that doesn't pass our muster either finds a home on the interweb, or the deep net, or dies on the resource it was born on.

Digital storage is rendering quaint the concept of ingestion under a curatorial eye, not because the practice of the curator is obsolete, but because the amount of available storage for use is increasing dramatically faster than our ability to produce information. We no longer have a finite resource ("shelf space") that dictates selective ingest as far as digital objects go. Therefore, we are in a position to be able to ingest indiscriminately, and curate after the fact to present collections. No-one is in a position to authoritatively judge the relevance of contemporary information to future generations, and by making digital ingestion a much less discriminating process, we leave more information for future generations to interpret.

The natural fit of the curator in a digital age comes when the urge to subsequently access that information arises, and response can then come in the form of a selective collection.

There's been a question I've heard expressed mainly in rhetorical terms around libraries, and that is the question of "When does a digital thing become considered published?"
I don't know if this question has been given a definitive answer, but I'm going to assume that someone clever has responded with something along the lines of "When a digital content creator has put something unique in the public domain." The thing to keep in mind about this digital content creator is that they are overwhelmingly likely to be an individual, working anonymously from their home or place of web, as opposed to a more traditional publishing company.

I'm not even remotely librarian-y, but I have to ask: When we collect and make accessible every serial publication and relevant book to come out of this country, why are we even thinking about the whys and shoulds of digital ingestion when we potentially have no shortage of storage space for it for the foreseeable future, and no need (other than an instinctual one) to invoke any form of conscious appraisal of the material during ingestion? Why are we not just running an automated ingestion process via a web form to connect with those teeming multitudes of individuals creating digital content?

We live in the era of the digital incunabula, and it seems to me that we spend a lot of time with the questions Why? What? Who? around digital ingestion - when instead we could just have a user-directed upload facility on a website, guided by a library-developed submission form, to inform the librarian of the characteristics of the information needed to make it accessible; and curate it after the fact, before it ends up another digital fatality. This provides the library with a much more relevant role in the act of record-keeping in the eyes of the average person. Surely this is low-hanging fruit if ever there was in the library world!

Friday, October 26, 2007

The Source: news about digital libraries and library innovations from around the web

Introducing The Source

Sharing, Privacy and Trust in Our Networked World (Note: PDF)

Download individual sections of the report

From the OCLC website

The practice of using a social network to establish and enhance relationships based on some common ground - shared interests, related skills, or a common geographic location - is as old as human societies, but social networking has flourished due to the ease of connecting on the Web. This OCLC membership report explores this web of social participation and cooperation on the Internet and how it may impact the library’s role, including:
* The use of social networking, social media, commercial and library services on the Web
* How and what users and librarians share on the Web and their attitudes toward related privacy issues
* Opinions on privacy online
* Libraries’ current and future roles in social networking
The research provides insights into the values and social-networking habits of library users


Everyone's guide to by-passing internet censorship (Note: PDF)

From the Citizen Lab, University of Toronto (Ron Deibert)

This guide is meant to introduce non-technical users to Internet censorship circumvention technologies, and help them choose which of them best suits their circumstances and needs.


Broadband: What's All the Fuss About? (Note: PDF)

From the Pew Internet & American Life Project website

Today, with nearly half of all Americans having high-speed internet connections at home, online interactivity means something different for a lot of Americans than it did when it was mainly about email. Many-to-many communication is now buttressed by many-to-many participation in the online world through user-created media. Still, questions remain about the use of advanced communications networks. Among them: Why does access to a high-speed connection at home matter? The fuss about broadband extends beyond access to information to active participation in the online commons as people with shared interests or problems gather at various online forums to chat or collaborate.


Propositions about innovation and change


From Stephen's Lighthouse (Stephen Abram) website

Innovation is a favourite topic of Stephen Abram's. Here is a list of innovation and change propositions by Jack Martin Leith, designed to stimulate thought and discussion.
Propositions include:
* Replace desired future with desired present
* Realise that change is not a journey
* Recognise the four genres of value creation activity
* Conversations, not messages
* Replace buy-in with join-in
* Liberate people’s passion
* Make good use of people’s wisdom
* Find out what works and do more of it

Wednesday, October 24, 2007

Do we need comment moderation?

The National Library has now been blogging for about 2 months. To date, Create Readers has 90 posts and 57 comments; LibraryTechNZ has 36 posts and 39 comments.

Comments are one of the things we're watching as we test the blogging waters. As I noted in an earlier post about setting up these blogs, we're pre-moderating comments on Create Readers (checking the comments when they are submitted, and publishing them ourselves) and post-moderating comments on this blog (comments are published as soon as they're submitted, and we check them afterwards).

The most-commented post on Create Readers is 'What book got you hooked', and on LibraryTechNZ it's 'The long(itude) and short on maps'. Both comment threads include comments from National Library staff, which do plump up the numbers a bit, but which are also replies to readers' questions or feedback. So far, we've rejected just one comment - an author who submitted a comment to Create Readers touting his books. One of the things we agreed before launching the blog was that it shouldn't function as an advertising site.

On the upside - we've not had any spamming. On the downside, our own spam filters seems to be blocking some of the comment notifications, which are sent from Blogger to a central email address and then forwarded on to various staff. This means we don't always publish or respond to comments as promptly as I'd like.

One of the things I find most interesting is that some people have used this blog - which is externally hosted - to let us know when they can't access other National Library sites. Now we're considering redirecting web visitors to a notification on this blog when we have a scheduled server outage.

Another thing that I know our Create Readers bloggers find really rewarding is the interaction that the commenting encourages - for example, on this post about Readergirlz, an American online book community. Create Readers gets more shout-outs like that; LibraryTechNZ comments tend to be queries or requests for more information (and the odd correction!).

Come the end of the year we'll be evaluating these blogs, and comment moderation is on the list of things to discuss. Although the experience has been overwhelmingly positive, I'll be recommending that we retain the moderation on Create Readers, as the blog is being used in class rooms.

To be frank, I've been overjoyed with the feedback we've received from readers; I was totally prepared to face up to a comment-desert come the end of the first 3 months of blogging. It's encouraging and motivating to know that people are getting something out of what we write.

Friday, October 19, 2007

The Source: news about digital libraries and library innovations from around the web

Introducing The Source

The Future of Electronic Paper

From the Future of Things website

Thirty-five years in the making, electronic paper is now closer than ever to changing the way we read, write, and study — a revolution so profound that some see it as second only to the invention of the printing press in the 15th century. Made of flexible material, requiring ultra-low power consumption, cheap to manufacture, and—most important—easy and convenient to read, e-papers of the future are just around the corner, with the promise to hold libraries on a chip and replace most printed newspapers before the end of the next decade. This article will cover the history, technology, and future of what will be the second paper revolution.


Take the “Did You Know?” Quiz About Digital Preservation

From the Library of Congress website

Did you know that digital materials can be more difficult to preserve than physical ones? Take this quiz to test your digital preservation know-how. Each result offers links to learn more about various digital preservation topics.


Tagging over Time: Real-world Image Annotation by Lightweight Meta-learning (Note: PDF)

From Stanford University InfoLab

Automatic image annotation has been a hot-pursuit among multimedia researchers of late. Modest performance guarantees and limited adaptability often restrict its applicability to real-world settings. We propose tagging over time (T/T) to push the technology toward real-world applicability. The T/T framework consists of a principled probabilistic approach to meta-learning, which acts as a go-between for a black-box annotation system and the users. Inspired by inductive transfer, the approach attempts to harness available information, including the black-box models performance, the image representations, and the WordNet ontology.


Image Retrieval: Ideas, Influences, and Trends of the New Age (Note: PDF)
Addendum Information (study on publication trends and impact on different fields) (Note: PDF)

From Stanford University InfoLab

We have witnessed great interest and a wealth of promise in content-based image retrieval as an emerging technology. While the last decade laid foundation to such promise, it also paved the way for a large number of new techniques and systems, got many new people involved, and triggered stronger association of weakly related fields. In this paper, we survey almost 300 key theoretical and empirical contributions in the current decade related to image retrieval and automatic image annotation, and discuss the spawning of related sub-fields in the process.

Wednesday, October 17, 2007

33,000 web professionals speak: The A List Apart survey

A List Apart (one of my favourite web resources) have just posted the findings from their 2007 Web Design Survey.

33,000 web professionals answered the survey, which covered areas such as the kinds of organisations people worked for, job satisfaction, length of time in the profession, area of work, education, skills, skill gaps and perceived biases.

The resulting 81 page report (note: PDF) makes for interesting reading. The first 20-odd pages are basic findings; the rest are detailed break-downs by age, gender, salary, job title, ethnicity, geographic location ...

Some of the things that interested me:

  • Gender split of respondents: Male 83%, Female 16% [1% no response].
  • Overall, women and men who are working full-time were earning similar amounts.
  • Designers were most likely to perceive their education as relevant to their work; writers/editors the least.
  • 73% of respondents had their own blog or website. It didn't really matter if you were a man or a woman, earned under $20K or over $100K, were a writer or an information architect, under 21 or over 60: you most probably had a blog.
  • Mark-up (HTML, XHTML) was the most-claimed skill; writing the least.
  • Interface designers were the most likely to feel they lacked a needed back-end skill: developers the least.
  • 20% of respondents who felt they needed CSS coding skills didn't have them.

Normally, reading survey results isn't high on my list. But in browsing through this one, I suddenly got a sense of how I professionally fit into a global workforce, and what that workforce looks like, thinks of itself, has evolved. I recommend it, especially if you're in a management position and looking to grow this part of your institution/business.

Tuesday, October 16, 2007

nzresearch.org.nz beta now live for searching institutional repositories

Just released in beta is the Kiwi Research Information Service which is a gateway to research documents produced at universities, polytechnics, and other research institutions throughout New Zealand.


The site provides an aggregating service by harvesting metadata from institutional repositories, and allowing visitors to easily find research documents from across New Zealand. I had a quick play this morning and there are some nice features. Top of the list are the click-through reports which show the most popular research docs... it's very digg-ish. There is also an RSS feed of latest additions. And the search works a treat... check out all the research on libraries for a start.

There are currently 3843 documents from 17 institutions, and harvests include the likes of:

This is a joint project between the National Library of New Zealand and many of New Zealand's universities and polytechnics. More information is also available covering:
Just a reminder that the site is in beta, so if there are any comments or feedback, please feel free to pass them on.

Friday, October 12, 2007

The Source: news about digital libraries and library innovations from around the web

Introducing The Source


Playing Tag: An RFID Primer (Note: PDF)

From the Pacific Research Institute website

The new technology rolled out to great fanfare from promoters and corporate customers. This new technology, some of the activists said, was nothing less than the mark of the beast, a harbinger of the apocalypse. Others warned that it would empower Big Brother to keep all citizens in line. Such was the response to the bar code, a technology that has become virtually universal, with no sign that the sky is falling. A similar scenario is playing out with RFID, a technology that has a far wider range of benefits than the bar code, but which has activists sounding the alarm and legislators, state and federal, rushing to the rescue.


Legal Issues for Social Networking Sites and Users

From the FindLaw website


It seems that everyone is a member of a social network these days. Whether it’s kids on MySpace and Facebook, or colleagues on LinkedIn, people are taking advantage of these new online meeting spaces to make friends, communicate and expand business opportunities. But what are the legal obligations that arise out of the use of social networks, both for the user and the sites themselves? The law in this area is still relatively unsettled, but some recent developments have created intriguing precedent, and legislation in motion promises to keep things interesting for the foreseeable future.


Ontology and the Semantic Web (Note: PDF)

From the dLIST website

This paper discusses the development of a new information representation system embodied in ontology and the Semantic Web. The new system differs from other representation systems in that it is based on a more sophisticated semantic representation of information, aims to go well beyond the document level, and designed to be understood and processed by machine. A common theme underlying these three features, i.e., turning documents into meaningful interchangeable data, reflects a rising use expectation nurtured by modern technology and, at the same time, presents a unique challenge for its enabling technologies.

Tuesday, October 9, 2007

Here comes Creative Commons Aotearoa New Zealand

With Creative Commons approaching its 5th birthday worldwide it's nice that it is finally reaching our shores. The Council for the Humanities has been working steadily away on this for the past year - with support from the National Library of NZ and funding granted from the Digital Strategy's Community Partnership Fund. They recently announced a Wellington seminar and a launch date of the 27th of October. Before we get into what this may mean for libraries let's do a quick recap.


What exactly is Creative Commons?
Creative Commons Aotearoa New Zealand says that "Creative Commons aims to establish a fair middle way between the extremes of copyright control and the uncontrolled exploitation of intellectual property. It provides a range of copyright licenses, freely available for public use, which allow those creating intellectual property – including authors, artists, educators and scientists – to mark their work with the freedoms they want it to carry."

Essentially, it provides a free set of licenses so that creators can clearly protect and share their work. There are six main Creative Commons licenses available that mix and match basic rights such as whether attribution is required, whether copies can be used for commercial purposes, whether individual parts can be reworked into new creations, and whether copies need to retain the same license.

A simple example of how Creative Commons is in use today is on the photo-sharing site Flickr. When you upload a photo you get to choose the license - you can choose from various Creative Commons licenses, or alternatively All Rights Reserved. This makes it clear whether users can grab your photo to use in a presentation or do whatever it is that they had in mind. The National Library of NZ has been running a small pilot on Flickr by posting selected collection images. We've had to post them as All Rights Reserved... which isn't really in the spirit of Flickr... but I'll cover that in another post.

Why bother?
The other day I picked up a leaflet issued by Creative Commons Aotearoa New Zealand. The title of the inside cover read "Why? Participatory culture". The leaflet went on to say:
"Digital technologies are connecting people in ways that were never before possible. Creative commons aims to help ensure a participatory culture - a culture in which everyone can actively engage in the creativity that surrounds us, and in which access is assured to cultural, scientific, and educational content that has been pre-cleared for use by it's authors."
And why would you use Creative Commons... the leaflet continued...
  • To maximise exposure and increase distribution;
  • To rely on innovative business models rather than fully fledged copyright to secure a return on your creative investment; and,
  • To contribute to and participate in the public sphere.
It's the best of both worlds really, providing real value to creators, and well as being valuable to everyone else.

International applicability?
The original Creative Commons licenses were drawn up by reference to US Copyright law and, while they were designed to be jurisdiction-neutral, its clear that some aspects of the US derived license do not align well to other country's legislation. Hence the efforts to port the licenses to other legal frameworks. The Creative Commons Aotearoa New Zealand licenses have in fact been derived from versions in the England and Wales.

Now whether this means that, if we choose a CC license for a photo on the US-based Flickr service, we will now be covered by the NZ-derived license... I don't know. I hope so. I'll need to dig into this to find out... any lawyers out there care to comment? Update: So I've spoken with one of the lawyers involved in drafting the NZ licenses and the view is that the NZ license will only apply if creators specifically assign them (makes sense!). In the case of Flickr there is no option to choose country-derived licenses so that kills that idea. There's a thread on Flickr where this is mentioned if you want more. In any case, the unported licenses can still apply. Please don't take this as legal advice... because it would be silly to take legal advice from a web designer. Seriously, if you are thinking about applying Creative Commons licenses to your works, get some professional legal advice or do make sure you read all the details so you understand what it means.

What does this mean for Libraries?
Virginia (of Matapihi fame) and I got to thinking about what impact CC may have on those institutions holding special collections or heritage archives:
  1. Will collecting institutions have to grapple with a new type of Intellectual Property? For instance, the born digital content we collect may already carry these licenses. Maybe its time to add new rights options to the collection management database and the Web Curator Tool.
  2. Collecting institutions sometimes invite donors to assign copyright to the institution. What if policies were altered to take account of CC as well? That when someone donated an item they also identified that derivatives and sharing were okay. While the role of mediator between the wishes of the donor and the requester in the public remain, the nature of those requests will change. We may even find ourselves designing systems to encourage play rather than simply provide access.
  3. What are the opportunities to make life easier on our users? Let's design systems that make it clear and transparent which items you can do things with, and which items you need to ask for permission. Paul Reynolds has suggested that NZ heritage collections should get together and put out some demonstrator works under CC. Sounds like a great idea.
  4. The National Library of NZ is also a Government Department, so material that we publish can come out under Crown Copyright. No doubt other libraries also copyright their own digital content as well, and I can't help but wonder if we shouldn't all use CC licenses instead... as it may end up being clearer and more helpful.
I'd be interested to know if you think this will have an impact on you. Does anyone know if there's been any impact to libraries in other countries? Keen to hear your thoughts.

Be Heard. Forever.

In August 2006, The National Library of New Zealand (Te Puna Matauranga o Aotearoa) Act 2003 was supplemented by the National Library Requirement Notice which extended the scope of material collected through legal deposit to include electronic documents such as CDs and DVDs. Great news for New Zealand's documentary heritage, but legislative changes are not a very exciting message to sell to musicians and their publishers.

The legal deposit process in New Zealand has been in place for print publishers for decades and there is an established and formalised process for requesting new publications.

What makes the music publishing world so different from the print publishing world is the proliferation of independent music producers and independent labels. Many bands are self-published or signed with very small labels that represent maybe a half-dozen bands. The more formal approach to requesting new publications that has succeeded in the print world was not going to work with independent music producers.

Well, why not?! you might ask.

The primary method of contacting publishers to request new publications is by snail mail. Sending a letter certainly won't work if the band is on tour. Most bands don't live together in the same house, and if they're not represented by a manager (and most self-publishing bands are not), then who should we contact? the lead singer? the guitarist? the drummer? the ukeleleist?

We also wanted to open up a dialogue with New Zealand musicians, to give them information about the legal deposit process and explain the benefits of participating. People don't want to be threatened with litigation. They don't want to wade through pages of legal jargon.

So how could we educate NZ bands about the legal deposit process in a way that suits them?

Eureka! MySpace!

There are hundreds, possibly thousands of New Zealand bands using MySpace as a platform for promoting their music. They update fans about album releases, tours, changes in band members. For the Legal Deposit Office, creating a MySpace profile meant we could communicate with bands in an informal way and allow us to easily receive information about the goings-on of every band on our friends list.

We launched the profile in conjunction with a nation-wide campaign to educate independent music producers about their obligation to the legal deposit process. The Be Heard. Forever campaign was launched in May to coincide with New Zealand Music Month and received nationwide coverage, an interview with RadioActive, a Legal Deposit feature on Music 101, and information was placed on music sites like APRA and NZ On Air.

The campaign was a collaborative effort between staff in the Legal Deposit Office, the Communications Department and the Web Team. Our MySpace profile includes information about the legal deposit process, multiple links back to relevant information on the National Library website, and blogs detailing recent legal deposit submissions.

The most obvious advantage to using a social networking site to communicate our message is the level of personal interaction we have with individual groups. Engaging with bands in this space helps to give the Legal Deposit Office a more human face. It creates a very direct line of communication, from the band to a real-life librarian: someone who is passionate about New Zealand's cultural heritage and adamant that all New Zealand music producers contribute to it.

Are you thinking about how or if you should populate the Web 2.0 space? Here are a few tips and lessons learned:

  • Find your focus - The last thing you should do is commit energy and resources to setting up a Web 2.0 presence for NO reason whatsoever. Always have a target audience and a clear message. For our MySpace profile, we targeted a very specific group with a very specific message: Independent New Zealand music producers need to know about the Legal Deposit process and understand the importance of their contribution to the New Zealand's cultural heritage. Doesn't get much more specific than that!
  • Find the right Web 2.0 space - One size does not fit all. Conduct research to find which Web 2.0 applications work best for your purpose. Before choosing the MySpace website, we considered other social networking sites such as Bebo and Facebook. While Bebo is clearly the favourite among young New Zealanders, we found that New Zealand musicians choose MySpace for promoting their music.
  • Find support from peers - Never work in isolation. Find people in your library who share your enthusiasm and collaborate with them! Seek out staff members with skills you don't have that might come in handy when creating your Web 2.0 presence (Web managers, Communications advisors, Web 2.0 enthusiasts).
  • Ride on coattails - If possible, associate the launch of your Web 2.0 presence in conjunction with something bigger than yourself. If you have a new library blog that includes reviews of recently published books by staff librarians, perhaps you could align the launch of the blog with New Zealand Book Month. A large part of the success of our Be Heard. Forever campaign was due to the alignment of the campaign with New Zealand Music Month. Newspapers, radio stations and music-related websites were abuzz with music news and were all the more willing to embrace our campaign and publicise it.
  • Don't underestimate the work involved - Social networking sites are just that: SOCIAL. There is a lot of time and energy spent communicating with people. For it to work you have to be an active participant. There are messages to send, emails to be checked, friends to make, profiles to maintain, comments to approve, friend requests to accept. The list goes on.

Monday, October 8, 2007

The Long(itude) and the short on maps

First up, a confession. I like maps.

There, I've said it. But don't change the channel just yet, this just might get interesting. Maps have been long, some might say unjustly, relegated to the too hard basket in libraries, a point driven home by Waikato University's Map Librarian John Robson in a presentation at LIANZA titled Shhh! - if we don't mention maps, they might go away. I'm not going to go into the rights & wrongs, he said/she said of that here, but rather try and articulate my thinking around using maps and geographic data as another entry point into our digital collections.

TAPUHI, our database of unpublished material in the Alexander Turnbull Library, has authority records for places and geographic features. If we can plot this data on a map, or series of maps, we should be able to come up with a nice visual, geographic browse for all of the images on Timeframes. As I see it (and please correct me if I'm mistaken), there are 2 ways we can do this

  • the kinda hard & clunky way &
  • the slightly less hard more inclusive, social way.

Catchy, huh? I'm not asking you to choose, though, as I'm not proposing that one way is better than the other. I am in fact proposing that we do both!

One Way: The kinda hard & clunky

How do we get geodata into, or out of the records that exist in Timeframes? It just so happens that Land Information New Zealand publish a downloadable dataset of delimited text tables which have been derived from the New Zealand Geographic Place Names database. This dataset contains information such as Place Name, Northing & Easting from the Universal Transverse Mercator coordinate system , Land District Code, and Latitude and Longitude.

We could do this by matching the place name data from the LINZ dataset with the Turnbull Library's Geographic Name Authority list (whose data has come from the LINZ database/gazetteer) and we should get Latitude & Longitude data for place names which we can then add to our metadata so we can start representing our digital images on a map. Simple, huh? This actually isn't that hard, it just involves a bit of data crunching, and then adding the latitude & longitude to the metadata. Then building the map.

Another Way: The slightly less hard more inclusive, social way

Let the people help! There are a fair few sites around that enable users to add data to their 'own' maps online. For example Zoomin, Yahoo via Flickr and of course, Google Maps. Anyone can create their own maps, and just to prove it, here's one I made earlier adding some Timeframes images to a Google map of Ngaio.

We don't need to harvest the source of these pages to get all the geospatial data. Along with the links to print/save etc on the map there is an intriguing option KML, which just happens to stand for Keyhole Markup Language, an XML-based language for managing the display of three-dimensional geospatial data in the programs Google Earth, Google Maps,Google Mobile and WorldWind. In other words, Latitude and Longitude and stuff.

Example:


<kml xmlns="http://earth.google.com/kml/2.0">
<Placemark>
<description>New York City</description>
<name>New York City</name>
<Point>
<coordinates>-74.006393,40.714172,0</coordinates>
</Point>
</Placemark>
</kml>

The really interesting to think about is the option of having people create their own maps & sending us the KML, en-masse, rather than us trying to geocode the images from the place authority (which would result in hundreds of images stuck on one spot in Ngaio, for instance). Apparently some folk are already making such maps IRL so there is a place to start from - it just needs some thought and co-ordination.

Things to think about:

In general, the metadata attached to an image on Timeframes records the location of the view, not where the photo was taken from. If we are geotagging where the photo was taken, can we/should we not also get data relating to the view? If we could search on views of specific objects eg Mt Taranaki, as opposed to where the photo was taken from eg
Stratford, it may be better for search/retrieval of accurate results.

This may also be too hard, or too labour intensive. I'd be interested to hear what you think.

Interesting things are being done with geographic information. Rheinhold Behringer has created an EXIF Editor/GeoTagging tool for Windows. This integrates with Google Maps for geotagging (scroll down near the bottom of the page) & is also capturing the field of view of the camera, based on parameters stored in the EXIF data about the camera chip and the focal length. In addition to the object coordinates, the camera viewing direction is computed and stored in the EXIF data explicitly.

He does note, however:

Many programs and website support the geo-tagging of the camera position. However, it is not known if any software supports the encoding of the object location. Therefore, this feature must be considered experimental and unique to this particular EXIF GeoTagger software.
Definitely some potential there, but strictly for images taken with digital cameras, not much help for our digitised photos. But, as more and more of our collections will be made up of born digital images, this is definitely worth keeping an eye on.

This has been a bit of a ramble, and I'm still trying to solidify my thoughts - there's so much going on. With the expansion & development of map APIs, there are more and more ways we can provide access to our collections. Now you can even map a plate of pasta!

Has anyone else out there done some thinking on this that could help me out? Your comments and suggestions are most welcome.

Friday, October 5, 2007

The Source: news about digital libraries and library innovations from around the web

Introducing The Source

Blogging and Libraries (An evaluation and impact of social media On libraries) (Note: PDF)

From Stephen Abram's blog, Stephen's Lighthouse


In the new digital age of computers, blackberries and Internet access the rise of blogging has been phenomenal. It has been reported that there are as many as 15.5 million active blogs online, although this seems to be now peaking. In this dissertation the author discusses what impact this new media is having on libraries (if at all), and why people actually blog (or do not).


AskNow Instant Messaging: innovation in virtual reference (Note: PDF)

From the National Library of Australia website

National and State Libraries Australasia’s (NSLA) AskNow virtual reference service has been in operation for over four years. It has proved to be a successful service and continues to receive an average of 2000 enquiries each month. In November 2006, the National Library of Australia (NLA) recognised an opportunity to launch a trial IM service which would coincide with the redevelopment of the AskNow website. What follows is an analysis of the key findings from the first phases of the trial and the issues encountered.


Guide to Online Participation (Note: PDF)

From the E-government in New Zealand website

E-government is about putting people first. It’s not just about technology – it’s about making government work for everyone. People who are affected by public policy and services are in a good position to help improve them. Online participation is one way of seeking their views. It also responds to people’s expectations that they will be more involved in the design and delivery of policy and services. The 'Guide to Online Participation' will help State servants navigate the complex and exciting terrain of technology-enabled public engagement and is meant as a first step in an evolving area of theory and practice.

Wednesday, October 3, 2007

Buzz Monitoring: Using Google Alerts to track the use of your digital products

Call me a member of the late majority if you will, but over the last few weeks I've been experimenting with using Google Alerts through my feedreader to track blog mentions of some of our digital products (including Papers Past and the Transactions and Proceedings of the Royal Society, this LibraryTechNZ blog and our Create Readers blog) .

[If you haven't tried this, go to Google Blog Search and enter your search term. At the end of the results page are three options: create an email alert, add a search widget to your Google homepage, or add a search feed to your Google Reader.]

Most days I'm getting around a dozen alerts. It's interesting seeing where mentions pop up. Some are what I'd expect - for example, the Auckland University Library History blog, Paul Reynolds and the Family Tree Forum (a genealogy discussion board) noting the relaunch of Papers Past with improved search functionality. Some are less so, like this mention of the site on the Furman University Libraries blog.

And some, in the words of an ex-manager of mine, surprise and delight me. Like articles from the Transactions and Proceedings being cited in discussions of global warming. The people who just love the Royal Society pictures. Or, my No. 1 favourite, Papers Past surfacing on John Adcock's Yesterday's Papers, in a post about Deadwood Dick.

It's not a comprehensive answer to the challenge of tracking what people are saying about your digital offerings, or how they're being used. And Google Alerts do seem to pull up old content (posts from August 2005???) . But it's time efficient, once you've selected your search terms. And it provides a window into the world of those users who don't love/hate you enough to get in touch directly.

Here's a list (via Marketing Pilgrim) of 26 free tools for monitoring buzz. Got another one? Add it in the comments .....

Virtually Ideal

One of the noble, iconic, pastimes of the internets is the opportunity it gives us to laugh at the media and creations of previous generations. Hence the transplantation from the realm of medicine to humour, of ads for Thorazine ("for prompt control of senile agitation!"), or insta-classic slogans like "More doctors smoke Camels than any other cigarettes", and other gems.

The subsequent understanding that develops over time around a subject, can allow us to re-interpret information we were once more prepared to accept at face value, and see that information in a more informed light. Matured understanding is to us the clearer vision revealing the Emperor's New Clothes, and this understanding can similarly leave us reaching for the eye-bleach, by giving us insights we wish we'd had forewarning of.

In this vein, consider our past and current expressed feelings around such wonder-products as Asbestos, 2-4-5-T, DDT, lead-based facepaint, or Television. TV is a particularly interesting example, because one of the many ways it became embedded in our culture was the promise it had as an educative tool, and this particular use was one of the key marketing points behind the indefatigable push the TV made into our living-rooms, bedrooms, studies, luxury cars, arm-rests, bathrooms, fish'n'chip shops, sports bars, and other great areas of intellectual dissemination.


Who, reading this, now watches telly for their intellectual nourishment? Every once in a rare while, we get a great BBC series like “Civilisation” by Kenneth Clarke or “The Planet”, that fulfills this original promise. However, for the other 99% of the time, there’s nothing on but the trash used to decorate prime advertisement timezones, and people watch it nonetheless.

Television was often shown in newsreels typically being discussed by either a guy in a white labcoat, who we took seriously because he was standing beside a bench with test-tubes on it, or by an authoritative man in a tweed suit, palpably reeking of higher education and Camel cigarettes. These patriarchal figures representative of the high state of human progress would cite fantastic examples "...and so, if you want to learn about Plato's Symposium, you could just turn on your tele-vision and watch a program about it being broadcast.", and the intense technological lust would spin our heads with the potential of the device. I'm still waiting for a decent broadcast on Plato's Symposium, btw.

Even the internet has been around long enough for us to start seeing it in a less idealistic light than when it wormed its way down our telephone lines at 300 baud. Yes, we use it for a great range of useful, practical, and enlightening things, but how much of that bandwidth is porn, useful, practical, enlightening, or otherwise? The point of this, is that there can be a disparity that becomes evident over periods of time, between initial intentions and subsequent realities.

Where is this Abe Simpson ramble going?

The next obvious wonder-technology (relating to what I do), that I think will show a similar disparity over time, is virtualisation. Virtualisation is the promising silver bullet to the technologist tasked with maintaining a digital information system across a large period of time, in that it can theoretically prevent the information archive from having to become a technology archive in order to remain an information archive. Over the course of the next year, the two major CPU vendors, Intel and AMD, will be making a big deal about their new hardware-based virtualisation models, dubbed Intel VT and AMD-V (they really burnt the creative midnight oil with those names, you can tell). Even the existence of these technologies shows the perceived importance of virtualisation to tomorrows computing world, and the respective manufacturers’ intended inference is the importance of choosing the "right" implementation, which is of course their own.

To illustrate the allure of virtualisation technology: If, in 2017, I have some published resource that only ran on Windows 95 that I have to make accessible, but I can't buy/hire/borrow/steal any hardware old enough to be compatible with Windows 95 and the resource in question, then the possibility of running an instance of Windows 95 on top of virtual compatible hardware has immense appeal to me. The capacity for simulation by digital information systems is unparalleled and self-evident. With digital simulation we can examine and re-examine any information we have suitably represented, in a consistent manner, allowing us to slow it down to the bite-sized pieces necessary for fruitful human interpretation.

So, in this way, I can simulate a computer presenting parameters acceptable to Windows 95, which means I can theoretically install Win95, the required supporting app/s for the digital object, and thereby make accessible the published resource I wanted. This is a concept with immense potential, and has been promoted favourably by very smart people with lots of extra letters after their names - Cornell University are heavily behind it as a key technique in digital preservation management, to name but one group of proponents.

Therefore, I feel somewhat of a grinch in saying that the archivists’ hopeful faith in this solution is misplaced. It is true that it would allow us to extend the effective access period to a given form of a digital resource - there is a huge degree of complexity supporting any digital thing, and the ability to virtualise some of that complexity is a way to apply a technological shorthand to a very, very, long chain of logic and knowledge. This is why virtualisation has emotional appeal (ok, a cold, geek, emotional appeal) - it is the implied promise of not having to get your head around that entire chain of supporting complexity.

The unconcious notion of computers as digital systems in the abstract sense, pure, precise, and clean-room ISO9001 compliant, under diffuse lighting in a white room, is a total defiance of the reality of short-lived electrolytic capacitors, mechanical storage devices dependent on aerodynamic principles in order not to "crash", vendors fighting each other with standards to sequester themselves a captive consumer base...anyone who hopes that virtualisation will give us back an echo of the willing, archive-friendly computers science-fiction promised us is clearly smoking something much more potent than Camel.

Then what's the problem? Why can't they smoke what they want?

The problem is this: Ordinarily, in the non-virtualised world, to support a digital object you start by looking at what it needs to run - does it need a specific application to run? Does that application have specific system requirements? Can those system requirements be met by the supporting hardware/software platform? Is that hardware/software platform a commodity item that can be accessed as required?

In a virtual environment, in which, lets assume that all of these requirements can be met with a virtualisation product of some sort, all we have done is make the virtualisation platform an extra part of the complexity supporting the digital object. This means that we still have an initial set of supporting conditions to meet, but they are the conditions allowing the virtual environment to exist. Then we have the secondary set of supporting requirements to be met by the simulated environment. All we have done is create the ability to transplant a layer of legacy dependencies into the virtual layer, and created the need to support them with contemporary dependencies. Net complexity increases. And then, once you get to the next major technology refresh, a digital generation later, it has the potential to get really recursive - do we need to virtualise the environment supporting the virtual environment to maintain the data?

Virtualisation is generally invoked when no other options exist – when data or its dependencies have proven non-portable in the real world. What will happen to our Windows 95 virtual machine if the physical machine underneath it reaches the point of replacement - do we end up with a host running an emulator running an emulation of an emulated environment? Extrapolate that model out ten generations of digital system refresh cycles....ye gods, that gets ugly. Or do we instead re-invent ways to virtualise a target OS one layer above successively newer platforms, requiring a dedicated VM development cycle to explicitly accompany our technology refresh cycle? What a pain.

With a virtual machine, we are still dependent on the goodwill and whim of a vendor or developer to go in a direction that suits us - and we work with the hope that their intentions will correlate with our needs. The lack of standards around virtualisation mean that there is still an interdependence between a specific application or kernel (ie VMWare or XenSource, Intel VT or AMD-V hardware) and a corresponding set of data representing the logic in the virtual object (like a .VMDX file for vmware, or a xen-aware OS installation on a partition, or an Intel VT/AMD-V compatible technology stack).

Now the key reason I'm not shouting optimistically about how a drive for virtualisation standards can solve these problems, is that in nearly every prior example in human inventive history, success within an innovative field is most widely obtained by those that most efficiently benefit from it economically. I'm stung by the time I believed in the wonder of asbestos. That 2-4-5-T sure did get rid of the mosquitoes at the beach, just like the ad said it would. I saw a smart guy on the newsreel telling me that I could gain profoundly efficient learning capacities by watching TV.

Now, I've read quite a few digital archive authorities say that virtualisation is the answer to the short and fragile lives of digital objects. I'm inclined to disbelieve them. Virtualisation is a fantastic tool to a geek like me, it's a highly useful thing to have for some of the stuff I do (making software images, application re-packaging), but it shouldn't in its current form be viewed as the tool of the digital archivist. Until we learn to make self-descriptive digital objects, independent of applications, OS, and hardware - digital objects will keep disappearing from the collective digital memory. What is required is a deeper look at the way people create and store digital things, not another layer of digital things.

Monday, October 1, 2007

Beginner's guide to beginning: upskilling in the digital environment

When I joined the Library in January 2006, I was hardly web-savvy. After getting involved in the project to redevelop the Library's main site, www.natlib.govt.nz, I decided I needed to upskill myself pretty promptly. In true geek form, I set myself the challenge of learning something new about the web everyday, and blogging it so that it stuck in my head.

Posts like this one by Ryan Deschamps, Under the Hood of Web 2.0: the top ten programming concepts for librarians to understand, have been invaluable to me. Here, Deschamps gives a brief introduction to:

  • Object-Oriented Programming
  • Client-Side Scripting
  • Relational Database
  • Server-Side Scripting
  • Http Protocol
  • Open source Software
  • The Document Object Model
  • Encryption and Digital Rights Management
  • Platforms
  • Stylesheets

Now that I'm a bit more up to speed, and have found the areas that I'm interested in, I've given up the daily blogging in favour of my trusty feedreader and ever-expanding Ma.gnolia bookmarks. Here are some of the sites I follow, broken down by area:

Writing for the web

Contented NZ's own Rachael McAlpine
Copyblogger
Problogger both this & Copyblogger have a strong focus on tips that will help monetise your blog/site, but the advice stands even if you're not looking to leave you day job and become a full-time blogger (umm).

Online communities, user behaviour & usability

Creating Passionate Users (now sadly defunct, but worth going back over)
Common Craft (makers of the now-famous paper videos)
The Future of Communities
Online Community Report These latter two are not so regularly updated, but the posts are often enlightening

General goings-on

TechCrunch Multiple daily updates of new products, buy-outs, people news and more
ReadWriteWeb Great source for daily reviews of programmes and products
Fresh & New(er) From the web team at the Powerhouse Museum, Sydney

Web development

These three magazines-style sites are regularly updated with articles on everything from designing for the iPhone to writing good alt text

A List Apart
Vitamin
Boxes and Arrows