Repository deposit from the desktop

Thinking about repository workflows for staff – put a deposit client where their resources live, on their desktop…

What I have:

A (slightly unwieldy) set of files comprising:

Quick drop file set

How it works:

The VB script was written by Boyd Duffy at Keele University and, as a non-developer, I know only that I need to edit  sword_deposit.vbs with my SWORD DEPOSIT_TARGET. It’s then simply* a matter of dragging and dropping a file (or multiple files) onto the VBS icon for them to be uploaded into the repository (workflow can obviously be configured in the repository itself, to be published immediately**, for example, or, more likely, go into a workflow where metadata can be added according to a particular Application Profile).

** I think Keele use it as a quick and dirty method for image files to be transferred from desktop to repository from where they can be immediately accessed via a VLE PowerLink.

Here is a screen capture that I did a while ago: http://www.leedsmet.ac.uk/inn/repository/video/SWORD_drop_from_desktop/

* Re simple – I can, in fact, only make it work from a Leeds Met IP!  Perhaps something to do with PROXY_HOST / wireless?

What I need:

METADATA of course!

The current tool is of limited use as it just pushes a file into the repository. In fact, it will quite happily push a Content Package – a Zip comprising a file and some metadata as XML – either an IMSMANIFEST (which I would need for intraLibrary) or METS for DSpace (i.e. Jorum.)

Though I don’t have the skills myself, I’m hoping someone can tell me how we might develop a desktop app to integrate a way of capturing the metadata associated with a resource, converting it into an IMSMANIFEST and/or METS, zipping the whole lot up and pushing it to a repository (or multiple repositories) via SWORD …

If we were to use our current ukoer AP we would need to capture:

  • Title
  • Description
  • (Uncontrolled) Keyword(s)
  • Author / owner / contributor
  • Date
  • Type of resource
  • Technical format
  • Licence information
  • Subject classification (HEA and JACS)

Click link below for an example IMSCP:

http://repository-intralibrary.leedsmet.ac.uk/IntraLibrary?command=open-package-download&learning_object_key=i3605n162666t.zip

Or link below for METS (with cut-down metadata); this package has been successfully deposited in Jorum (dev) via SWORD:

http://repository-intralibrary.leedsmet.ac.uk/IntraLibrary?command=open-preview&learning_object_key=i3128n92902t

N.B. A practical issue with this approach might be including such an application on an institutional staff build and I have heard rumours that it might be possible to achieve similar drag and drop functionality with a web-based app using HTML5 – browser support still inconsistent though I think.

Learning How to Play Nicely: Repositories and CRIS

My report on the “Learning How to Play Nicely: Repositories and CRIS” event organised by the Welsh Repository Network and supported by JISC and ARMA back in May has now been published:

http://www.ariadne.ac.uk/issue64/wrn-repos-2010-05-rpt/

JIF10 at Royal Holloway

I’m not sure I was fully aware of the memory until I wandered past the Earth Sciences Building at Royal Holloway last Wednesday that it was in that very building that I first experienced the internet in 1994. I was a feckless undergraduate with little interest in technology having recently grown out of side-scrolling shoot ’em ups on my Commodore Amiga.  I had a scientist friend, however, a Geologist, who seemed excited by it and took us into the lab to surf the nascent world wide web.  I have no idea what spec of machinery we would have been using at that time.  Or browser.  Would it have been Mosaic back then or did Netscape come on the scene fairly quickly?  Anyway, I don’t think I was terribly impressed and soon wandered off for a game of pool and did not, so far as I recall, re-visit cyberspace for the rest of my university career.  At some point in the intervening years, of course, along with several billion others, I did venture back onto the Web (it’s got much better) until I found myself back at RHUL with JISC for their Innovation Forum 2010; on the down-side I no longer have much time for pool.

I arrived bright and early on Wednesday morning to set up my exhibition stand and was particularly looking forward to speaking with Alison Pope who had invited me to look at Equella which Royal Holloway have recently implemented to manage the full range of digital resources including OA research and learning objects – a set of use-cases not too dissimilar to Leeds Met.  It is interesting that, like intraLibrary, Equella was originally designed as a Learning Object repository and, like intraLibrary, this heritage is evident in Equella being an authenticated system (does this, perhaps, reflect how quickly the zeitgeist has now shifted towards open educational content not just OA research?)  We get round the issue, of course, with our Open Search interface utilising SRU, whereas both RHUL and Oxford Brookes automatically log visitors into Equella as guest-users.

I had several conversations throughout the conference around the nature of “blended” repositories like ours; although institutional repositories increasingly store different types of resources and may have an implicit remit to manage the full range of digital material, it’s still far from the norm, I think, for them to be explicitly tailored to multiple collection types and browsing OpenDoar, under “content”, the majority still list “Articles; References; Conferences; Theses; Books” though some do include “Learning Objects”, “Multimedia” or “Special” (how many of these are aspirational rather than fully operational services?)  To my knowledge, in addition to ourselves and Royal Holloway, Oxford Brookes are developing a blended repository with their RADAR project (also using Equella after shelving intraLibrary).  Coventry have also implemented Equella for their CURVE project though I think currently for research material only.

(N.B.  The EPrints crowd are coming at the OER from the other direction with Edshare and HumBox, bespoke implementations of the de facto OA research platform.)

The first session I attended was Thunderbolt and Lightning, an Open-Mic session run by the inimitable @dfflanders to share the “big thunderbolts” in institutions today (all sessions from JIF10 are summarised on Cloudworks and there is a Twitter archive at http://twapperkeeper.com/hashtag/jif10).  The great thing about JISC events is the different styles of discussion that are encouraged and while, personally, I find the lightening talk format exhausting (and that’s just listening) it’s certainly an effective way to rapidly share a lot of information and generate (preliminary) discussion.  Lightening talks included Rufus Pollack from the Open Knowldge Foundation and Cambridge University on open data and content, Steve Hitchcock from Southampton on why it’s still too hard for people to put things in repositories and Sarah Porter on the fact that we need a collaborative university network, not a competitive one.  For a live blog of the session see http://cloudworks.ac.uk/cloud/view/3408.

I missed the first afternoon session as I was having a guided tour of Equella after which I had half an hour to kill so took a trip down memory lane back to Kingswood, the halls of residence where I stayed in my first year before rejoining my fellow delegates for a panel discussion “The Impact of Open: transformational possibilities for education and research”.  The session was chaired by Craig Wentworth (JISC Programme Director – Organisation and User Technologies) and the panel were Rufus PollockRoss GardlerHelen Beetham, Sal Cooke and Hugh Look.  See http://cloudworks.ac.uk/cloud/view/4183 for a blog-post from the session.

The evening was given over to dinner and (for some) dancing – an opportunity for social and professional relationships to be cemented before rest and another packed schedule on day 2…which began with a search for bacon and eggs, as hapless JISC folk (one or two a little green around the gills – possibly correlated with those that had been dancing but this is just a hypothesis) finding themselves in a breakfast hubbub of Italian teenagers before being herded downstairs by events staff to our own more sedate dining environment.

I had intended to visit as many of the project showcases and demos, some of which I was already aware of and some which were new to me.  In the end, however, my day was rather more tangential and I never got round to seeing all sorts of interesting stuff that I shall at some point try to follow up online, in particular, One Share (University of Southampton) and Xpert (University of Nottingham) which though I missed the demo, I have, in fact, liaised with already in the context of RSS aggregation for ukoer (see http://blogs.cetis.ac.uk/lmc/2009/12/09/oer-rss-and-jorumopen/).  I was also able to speak with Pat Lockley from the project who told me that Xpert can now harvest OAI-PMH as well as RSS – after a little bit of hacking this week on Pat’s part, all Unicycle material has now been harvested from the Leeds Met repository into Xpert (will try to blog this in more detail at some point).

I did manage to make it to the MIMAS stand and speak with Amanda Hill of the Names Project and to man named Vic about the Intute repository search demonstrator which I was interested to learn will be developed further (last I heard development was suspended.)  Having not been to the site for quite a while, it was gratifying to see that searching for “ukoer” returns 146 results from just 3 repositories and 144 of them are from Leeds Met (though there are in fact 279 resources in our repository tagged ukoer so I’m not sure what’s happened to the other 135 – perhaps we haven’t been harvested for a while?).  The other small issue from my perspective is that research results are not always displayed properly which is an artifact of the fact that intraLibrary is based on IEEE LOM and I’ve used multiple description fields to accommodate ISSN/published/refereed.

I really hope the Names project continues to make progress and I would urge the community to get behind this project which has developed a prototype to tackle the thorny problem of name disambiguation – an issue in all sorts of online domains, especially repositories.  There is a blog of the session that I missed at http://cloudworks.ac.uk/cloud/view/4243 and you can see the prototype and search for names (harvested from Scopus I think) at http://names.mimas.ac.uk/

In the afternoon there’s a rather jolly Dragon’s Den style session where 3 presenters pitch their innovative ideas to a panel of JISC Dragons who opt, 3 to 1, for Mark “bearded Dave” Stiles’ (call me Dave) Cameron inspired Big Education idea, the audience, however, disagree (this is a democracy after all) and it’s a land slide for Yorick Wilks’ possibly-cuddly-artificially-intelligent research companion to help the academic drudge sift through the crushing weight of online information.  Dave the Librarian’s idea (something about libraries being an interchange where learning and teaching takes place) gets limited support from either the dragons or the audience – all seemingly persuaded by the ConDems that libraries aren’t really necessary at all in the broadband age – and he is dismissed by one of the dragons (clearly relishing her role) because libraries are just “glorified coffee shops”… ‘we just need a virtual library”…”people only go to libraries because the wifi is free.”

The final keynote is delivered by Professor John Potter on the subject of leadership and which causes one or two heckles to rise – personally I’m inclined to agree with @ambrouk who found sentiments like “give your people a sense of purpose”, and “people copy what they see” somewhat patronising and argued on her blog that, rather, the world of HE is a “world of intelligent people who don’t need leading, they just need managing, as in, given the time, space and mandate to do things.”

That is certainly true of many of the colleagues I engaged with over two enjoyable days – people, in fact, that “work” as they play, sharing information and ideas.  Innovating.

JISC Innovation Forum 2010 (JIF2010)

I’m exhibiting at JIF2010 next week at Royal Holloway University of London and looking forward to it, not least for reasons of nostalgia as I graduated from RHUL in 1996 (BA English Lit) and haven’t really been back in over a decade.

If I’m honest, I wasn’t the most conscientious of students, exhibiting at the time a degree of post-adolescent rebellion and fancying myself as some sort of undergraduate libertine (if we re-define libertine as someone who stays in bed all day and drinks a bit too much beer).

While it’s all too easy to cast a repository manager as Byronic Hero (revolt against society, pursuit of individual goals, romantic expression and the constant experience of strong emotion)  I now get up early and have worked considerably harder over the past 3 years (nearly) on our various JISC projects at Leeds Met than for my degree and arguably know more about Open Access and repositories than I ever did about the canon of English Literature – there’s a lesson here I think – if you don’t work hard at University you might end up as an (un-Byronic) repository manager!

Our JISC projects, in chronological order, are the Repository Start-up itself, Streamline, PERSoNA, Unicycle and Bibliosight and Dawn has put together a fantastic poster for my exhibition stand featuring a time-line and showing how these 5 projects relate to one another – it also includes our current ACErep project which is funded by HEFCE rather than JISC but which very much builds on our JISC supported projects (click for larger image):

Repositories: Back to the Future

The annual Intrallect conference will be held on 28th-29th April 2009 at the Scottish Storytelling centre in Edinburgh.  I attended the event last year which was excellent and will be of interest to repository folk from accross the ukoer spectrum – not just Intrallect’s customers.

“Repositories?  Where we’re going we don’t

need repositories!”

The “Back to the Future” theme provides scope for considering how we got to where we are today, could we have done things differently, what have we done well, what would we never want to repeat? How should the future be shaped? Will it be driven by technology or by educational need?

For more information see http://www.intrallect.com/index.php/intrallect/news_events/events

UKCoRR meeting

I wasn’t able to attend the UKCoRR meeting held in Kingston on Friday, as much as I would have liked to.  It sounds like I missed out on a really good day with an excellent programme.

A thorough summary and all the presentations from the day are available from the UKCoRR website:

http://www.ukcorr.org/events/aug2009-event.php

In addition, there is a summary on the UKCoRR blog:

http://ukcorr.blogspot.com/2009/08/after-our-meeting.html

I was particularly interested in Theo Andrews’ presentation on Central Funds for Open Access and ensuing discussion around institutionally designated funds for OA – both Gold and Green routes.  I hope UKCoRR don’t mind me reproducing some of the issues discussed here:

1) Concern about the costs: these might escalate, and sometimes amount to “double dipping” (some publishers are paid by authors and subscribers because they charge authors for OA article publication but don’t reduce their subscription fees).
2) Publishers who are aware of funder mandates for OA within 6 months, might introduce 12 month embargoes on post-print availability in OA repositories, in order to force authors to pay for OA publishing of the final version or miss their funder’s mandate. (NB the point here is that funders are paying, as authors can claim such costs from funders. But we’re all struggling to set up mechanisms by which this can be done – see Theo’s presentation for a summary of the issues.)
3) An institutional response might be to set up an OA fund, or it might be to encourage authors to deposit post-prints into the OA repository, rather than paying such publishers’ fees. Some researchers object to the fees being charged.
4) The Wellcome Trust does seem to prefer that the authors pay for OA publication, and indeed it suits authors better than depositing themselves because a part of the Wellcome mandate is for PubMed deposit. By paying, authors can leave the PubMed deposit up to the publishers to do. Is the Wellcome Trust’s mandate skewing the OA landscape in the way publishers have responded to them, whilst other academic disciplines are no way near as well funded?

The inimitable @llordllama has also posted summaries of the day on the UoL Library blog:

http://uollibraryblog.wordpress.com/2009/08/18/ukcorr-summer-2009-meeting-pt-1/

http://uollibraryblog.wordpress.com/2009/08/18/ukcorr-summer-2009-meeting-pt-2/

On the strength of this I’m certainly looking forward to attending future UKCoRR events – maybe even oop North next time?!

Edinburgh repository fringe – links

I wasn’t able to attend the Edinburgh repository fringe but have been scouring the blogosphere for info – unlike the real Edinburgh fringe there doesn’t seem to be much mainstream media coverage, not even a late night slot on Channel 4.

I haven’t found masses beyond the main website and the event wiki – http://wiki.repositoryfringe.org/index.php/Main_Page – though may be I’m jumping the gun a little.

The best live blogging seems to be on the jisc-datashare blog

Posts are tagged #repofringe09:  http://jisc-datashare.blogspot.com/search/label/%23repofringe09

The only other posts I’ve found so far are:

Lorna’s JISC CETIS bloghttp://blogs.cetis.ac.uk/lmc/2009/07/31/repository-fringe-2009/

And

The Open Knowledge Foundation Bloghttp://blog.okfn.org/2009/07/31/open-data-session-at-repository-fringe-2009/

For pics there is also a flickr group at http://www.flickr.com/groups/repofringe09/