Implementing the Symplectic API

We’ve made real progress implementing the Symplectic API which I hope will help motivate academic staff to update and maintain their Symplectic profile and, who knows, perhaps even encourage them to upload full-text to the repository! Kudos to web-developer Mike Taylor who has done all the clever stuff (though this summary reflects my understanding so may contain errors!)

As can be seen in the screen-shot below, Mike has been able to submit a query to the API (using Leeds Met username as a parameter) and differentially parse the resulting XML by publication type including, where available, links to DOI and full-text in the repository (currently labelled as Public URL). Symplectic also has the option to “favourite” records which is flagged in the XML and which we’ve use to identify “Selected publications” in order to give academics greater control over their profile (there is also a “make invisible” option to prevent specific records being exposed from the API.)

The next step will be to liaise with the corporate web-team to explore how the feed can be embedded in the institutional CMS. We’ve already picked a few brains and it shouldn’t be too difficult though there are still one or two technical issues including how best to submit a query; we wouldn’t want to use username as that would be a privacy issue and the preference would be email address though this will require a layer of translation from email address (which isn’t searchable)to either Leeds Metropolitan username or Symplectic internal user id. In addition, the API isn’t designed to be hammered dynamically so results need to be cached so there are questions how best to refresh that cache to reflect changes that academics may wish to make to their profile.

Advertisement

An institutional tangram – musings on developing an integrated research management system

“The tangram (Chinese: 七巧板; pinyin: qī qiǎo bǎn; literally “seven boards of skill”) is a dissection puzzle consisting of seven flat shapes, called tans, which are put together to form shapes. The objective of the puzzle is to form a specific shape (given only an outline or silhouette) using all seven pieces, which may not overlap.”

http://en.wikipedia.org/wiki/Tangram

Having implemented an institutional repository at Leeds Metropolitan and learning by experience some of the difficulties associated with advocacy around the use of that repository (both for OA research and OER) I have become all too aware “that repositories are ‘lonely and isolated’; still very much under-used and not sufficiently linked to other university systems”. So said JISC’s Andy McGregor at an event called “Learning How to Play Nicely: Repositories and CRIS” in May 2010 at Leeds Metropolitan (see my report for Ariadne here). This quote is still relevant, though  perhaps a little less so than when I heard it nearly 2 years ago, thanks to the ongoing work of JISC and particularly the RSP. In any case, the event was a revelation for me and I have coveted a so called Current Research Information Management systems (or CRIS for short) ever since!

And now, in Symplectic Elements, I have one…or at least the components of one (click on image for full size.)

The finished tangram? (click on image for full size)

It’s a puzzle though. A tangram if you will…one with considerably more than seven pieces:

intraLibrary, Symplectic, institutional website, University Research Office (URO), faculty research administrators, The Research Excellence Framework (REF), academic staff, web-developers, bibliographic information, research outputs, Open Educational Resources (OER)…

In fact, this may well not be all the pieces…pretty sure a few have been pushed down the back of the settee. I’ll look for them later.

Anyway, tortured metaphors aside, I have become increasingly aware that working in a large institution, in a role that encompasses technology and institutional policy (though I’m not, by any means, a policy maker…or indeed a real techie) is largely about communication and getting the right people, with the right skills, in the right place at the right time! Absorb policy and technical requirements from senior stakeholders and communicate those requirements to the proper techies – while also trying to ensure any motivating passions of one’s own don’t get lost along the way – Open Access to research and Open Education in my case.

For various reasons, individual user accounts have never been implemented for our repository and historically it has been administered centrally from the Library. In Symplectic we now have a system that is populated with central HR data; all staff will have an account they can access with their standard user name and password from where they can manage their own research profile including uploading full-text outputs directly to the repository*. In addition, administration by the University Research Office and faculty research administrators will be more easily centralised (particularly for the REF).

* In actual fact this functionality is not yet available in lieu of development work from Intrallect to capture the Atom feed from Symplectic and transform with XSLT to a suitable format for intraLibrary. I think.

One of the clever bits of functionality used to sell the software is automatic retrieval of bibliographic data from online citation databases – we are currently running against various APIs, Web of Science (lite), PubMed and arXiv – but I think this may actually be a bit of a red-herring for an institution like Leeds Metropolitan – at least until more (preferably free) data sources are available (JournalToCs API please!); early testing has shown, at best, it will only retrieve a subset of (the types of) outputs that we will need to record and it will be necessary to manually import existing records (e.g. EndNote) as well as implementing other administrative procedures at faculty level to capture information at the point of publication, especially for book-items, monographs, conference material, reports and grey literature.

More important, I think, to ensure that academic staff actually engage with the software rather than just seeing it as a tool for administrators, is to re-use the data to generate a list of research outputs – a dynamic bibliography – on a personal web-profile which has the potential to dramatically increase the visibility of research including Open Access to full-text.

Developing staff profiles of this type has been something of an obsession of mine for a while; we explored doing so from the repository (using SRU and email address as a Unique Identifier) and did develop a working prototype. Symplectic, however, integrated with central HR data and with its more sophisticated API, should make it much easier, at least from a technical perspective, and we are currently liaising with the central web-team to develop something similar to this example from Keele University – http://www.keele.ac.uk/chemistry/staff/mormerod/ (like us, Keele run Symplectic alongside intraLibrary.)

N.B. From the Symplectic interface, a user is able to “favourite” a research record and a flag comes out in the xml from the API which I understand is used on this page to display “Selected Publications”. DOI is also available from the API to link to the published version and if a user uploads full-text to the repository from Symplectic, this link is also in the xml – the first two records on this page include links to the full-text in Keele’s intraLibrary repository.

Our own Library web-dev Mike Taylor has been looking at the Symplectic API in detail and has put together a couple of prototype pages on a development server and after a meeting this week with a representative of the central web-team I’m reasonably confident we can move forward with this work fairly quickly…though there’s still a bit of a chicken & egg situation in populating the Symplectic database to then be re-surfaced via the API in this way.

There is also the question of whether we might alter our repository policy to become full-text only; one limitation of repositories across UK HE from an original conception (in the arXiv mould) of holding, disseminating and preserving full-text research outputs, is that they have in effect become “diluted” by metadata records for which it has not (yet) been possible to procure full-text or copyright does not permit deposit and “hybrid” repositories like ours, of full-text and metadata typically contain more metadata records than full-text (see figures from the RSP survey here). As I have argued on the UKCoRR blog, I think is makes sense to separate a bibliographic database (in Symplectic) from full-text only in a repository.

N.B. As Symplectic does not have the same search functionality as the repository, this approach has the potential disadvantage that it makes it more difficult to search across the entire corpus of research records (though one potential solution may be along the lines of that implemented by City Research Online which, in my view is rapidly becoming an exemplar of a research management system (Symplectic) + full-text repository (EPrints). Another good example is  St Andrews (PURE + DSpace) who presented a case study at “Learning How to Play Nicely: Repositories and CRIS” (video here.)

And what of OER? Along with our EasyDeposit SWORD interface, using OER to resource the refocus the undergraduate curriculum and the soon to be released intraLibrary 3.5 that will enable us to harvest OER from other repositories…for now I think they may be the bits down the back of the settee…

British Library special collection: ‘Race’, Ethnicity and Sport

Hylton, K. (2008) 'Race' and Sport: Critical Race Theory. Routledge.

Dr. Kevin Hylton, Course Leader – MA Sport, Leisure and Equity here at Leeds Met, is working with the British Library to assemble a special collection of material around ‘Race’, Ethnicity and Sport.  Dr Hylton has already collaborated with the British Library on their website Sport & Society – the Summer Olympics and Paralympics through the lens of Social Science which includes a synopsis of his book ‘Race’ and Sport: Critical Race Theory published by Routledge and which “takes on the controversial subject of racial attitudes in sport and beyond. With sport as his primary focus, Hylton unpacks the central concepts of race, ethnicity, social constructionism and racialisation, and helps the reader navigate the complicated issues and debates that surround the study of race in sport.”

The new collection will be archived at www.webarchive.org.uk which, under the auspices of the BL, aims “to collect and permanently preserve the UK web” – more info here – and the Public Call states that “we hope that the ‘Race’, Ethnicity and Sport Collection will provide a valuable resource for researchers now and in the future.”

As far as I understand, Dr. Hylton is currently at the stage of identifying suitable material for the archive and asked me whether it was possible to cross-search UK Institutional Repositories to discover relevant full-text research material in this area (having, on numerous occasions, had the [mis]fortune to hear my advocacy on Open Access and repositories!).  As far as I am aware there are two services currently available – the UK Institutional Repository Search from MIMAS and the custom Google Search at OpenDoar (I’d be interested to know of any others) and some preliminary searches yielded a few relevant results – though there is no way of specifying full-text only, of course, which means many results are bib records only.

It’s perhaps still a moot point whether there is real value to a fully functional IR cross-search tool (in the style of http://rian.ie/en for Irish repositories) and the MIMAS and OpenDoar tools are described respectively as “demonstrator” and “beta” but, as Dr. Hylton’s interest supports, I’m inclined to think that such a tool, properly promoted and combined with a fully realised system of Green OA would indeed benefit the academic community, especially since Google abandoned support for OAI-PMH; I do think it would be necessary, somehow, to be able to filter by full text however which perhaps keeps the idea moot for now…

In the meantime, if anyone does have appropriate full text material archived in their repository please let us know and/or pass the call on to interested colleagues.

How to Build a Case for University Policies and Practices in Support of Open Access

Briefing paper written by Alma Swan and Frederick Friend on behalf of JISC:

http://www.jisc.ac.uk/media/documents/publications/programme/2010/howtoopenaccessfinal.pdf

Repository week

From 1st – 5th June is repository week when colleagues can come together to learn more about the repository and how it will contribute to the information infrastructure of the institution.

Projects that are supported by the repository include an Open Access research archive and research administration; reusable learning objects/Open Educational Resources and Personal Curriculum Creation and provision of distance learning.

The final programme of events is yet to be finalised but will be a series of learning lunches at Headingley and Old Broadcasting House. For more information or to register your interest please contact Nick Sheppard or Kirsty Maw.

Digital Britain needs access to science journals, not YouTube

Pertinent article in the Guardian:

http://www.guardian.co.uk/technology/2009/feb/05/science-journal-access

Launch party today!

At 2pm today, as part of the Library Festival, we are hosting a launch party for The Repository at Civic Quarter library. There will be speeches and cake and Dominic Tate from RSP is joining us to help us put our repository into its institutional and national context.

There is still a lot of work to do before the end of the project in March but I am very proud of what we have achieved so far and today’s event should certainly raise awareness as we consolidate the system and build up the repository content as the March deadline approaches.

Click the image for video

Click the image for video (PowerPoint & Camtasia as .wmv)

Show and TEL

Yesterday I spoke to colleagues at the monthly TEL network meeting which, technical problems notwithstanding, was an opportunity to engage, in particular, with learning technologists who should prove invaluable as intermediaries back to their respective faculties.

I wasn’t actually able to show them intraLibrary itself as the network seemed to have developed bahavioural problems but I did show the search interface, described how we had come to this point and what further development work there is still to do on the interface and the associated infrastructure that will surround intraLibrary – a lot of interest in the PowerLink to X-stream.

It was also a chance to plug The Repository Day which is taking place at Old Broadcasting House on November 10th when colleagues are invited to participate in one of 4 workshops that I will be running throughout the day. There will be a software demonstration (I hope!) and folk will have the opportunity to use the system and to inform ongoing development and to try out some new tools produced by our collaborating projects Streamline and PERSoNA. I also hope that people will bring their own material for upload to the repository which can be copies of published research papers or Learning Objects that they have created and would like to share with colleagues.

At the meeting we were also shown the new ALT wiki – I’ve got myself an account and started building a repository page – there’s not much there as yet!

Staff Development Festival

Leeds Met is a University of Festivals, we are often told, and the next fortnight will be the Staff Development Festival which will provide a unique opportunity to promote The Repository.  And I’m going Scuba diving, albeit in a swimming pool.

All the repository pieces are now in place and I intend to demo the search interface and the repository proper, presenting it as a semi-blank canvas that now needs to be painted upon by the University community.  I’ve already had some feedback from Jonathan Long, Director of the Carnegie Research Institute and a member of the consultancy group who would like a search to return formal Harvard references for each item emphasising that one of the reasons for setting up the repository is to increase the number of citations – the interface is just ‘out of the box’ at the moment and returning results in the manner of the IRISS interface – I’ll gather input over the next fortnight and Mike should be able to do some customisation when we have a clearer idea of what people want.  I might even have a go myself although, with no knowledge of php, I can’t make head nor tail of the site files, my web skills having stalled at basic HTML, CSS and (very basic) Java Script.  Mike has already added browse functionality which is, for the moment, based on faculty structure – I’ve set up a collection within intraLibrary for each faculty and it is these that Mike is using to generate the results although it should also be possible to use metadata fields – I think we will map DC ‘subject’ onto LOM ‘keyword’.  It might be tricky to incorporate numbers of records after browse links and I’m waiting to see what Mike has to say on this.

Anyway, for now, I have 5 citations per faculty which is adequate for initial demonstrations and I’m working on some full text content – several of the citations I’ve uploaded are RoMEO green/yellow so it’s just a matter of getting hold of author versions.

As for promotional material, I’ve ordered a big purple recoil stand similar to that for the Library and I’ve three info sheets to print up in quantity:

The Repository is an introduction to the project and to IRs specifying our dual remit for the Leeds Met repository.

Open Access: What’s in it for you? emphasises the evidence that OA increases citation (using a graph from Steve Lawrence’s seminal article Online or Invisible? (2001) which is a bit out of date but by far the clearest visual representation I have been able to find.)

And

Copyright presents a very simple flowchart of the (self)-archiving process.

I shall also try to put together a narrated presentation to run when I’m not there.  A couple of lap-tops and we’re away!

Incidentally, here is a link to the search interface:

http://repos-dev.leedsmet.ac.uk/main/index.php

(Currently only accessible from a Leeds Met IP)

Guardian Article

There is a section in today’s Guardian about the future of academic libraries which includes an article on Open Access.