Metadata – some platforms include subtitles, others do not; “date” may refer to date published, copyright date, or date posted online; editors are sometimes named as authors; etc. In none of the cases they examined did the platform-generated “MLA citation” actually match MLA format. Searching – different platforms may return search results at the word, page, or chapter level. Most (61%) were chapter-level, which is probably the least useful for searchers. Pagination – system page numbers often don’t match the page number displayed on the PDF (probably due to how front matter is counted); in EPUB format, page numbers are often missing altogether. The presenters showed examples of how search results may vary wildly from one platform to the next. This can be caused by search functionality, such as auto-stemming, or how the platform treats hyphenation, or whether it defaults to AND or OR searches. They also found problems caused by OCR spacing errors — e.g. “Japa nese” or “infl uential”, or words joinedtogether withouta space. See their slides on SlideShare for side-by-side examples.
Michael Matos of American University shared his analysis comparing library journal holdings to works referenced in faculty publications. The goal was to use the data to demonstrate the extent to which faculty rely on the library for their research. I confess that his complex methodology lost me. Next steps include looking more closely at the materials referenced which are not held by the library, then compare that to ILL data (thus demonstrating that the researcher also used the library for those materials).
In “Strategies for Expanding eJournal Preservation,” Shannon Regan, from Columbia University, described a Mellon Foundation Grant-funded project to identify e-journals that are not currently being preserved by a trusted 3rd-party repository, learn why they are not being preserved, and explore ways to get them preserved. I was kind of surprised, and kind of not-so-much, at the amount of content—even from major publishers—not being preserved. As for reasons why, I came away with the impression that the most prevalent reason is a question of rights/permissions. In some cases, a publisher may not have secured rights from the authors; in other cases, publishers (typically smaller ones) have no understanding of the need or the process for preserving content, or may fear a loss of control over the content (thinking, for example, that permitting an archiving agency to preserve the content would be equivalent to making the journal open access). Other times, the step of preservation may just slip through the cracks. Regan recommended that librarians should make preservation a part of the conversation with publishers, vendors, consortia, faculty, and other stakeholders.
In a fun presentation, Kristen Garlock of JSTOR and Eric Johnson of the Folger Shakespeare Library described some projects/products developed as an outgrowth of usage data. The first was JSTOR Classroom Readings (http://labs.jstor.org/readings), a free tool intended to give educators a list of articles for core courses. Developers had originally wanted to gather college syllabi and curate a list of articles from those, but there were too many obstacles. So instead they looked at usage data for signs of “teaching use” (short bursts of use at a single institution). Though not perfect, and not yet considered final, Garlock seemed pleased with the methodology and the resulting product. Johnson talked about (among other projects) a JSTOR tool called Understanding Shakespeare (http://labs.jstor.org/shakespeare). The user can select a play, then choose a line in that play and get a list of articles in JSTOR that quote that line. Again, not complete (only includes 12 plays so far), but a pretty nifty tool.
In other sessions, I learned a few new Excel functions to try out, plus a couple of things to try with CORAL. I was also pleased to hear EBSCO Chief Strategist Oliver Pesch say very plainly and repeatedly that EBSCO supports customer choice and is actively seeking ways to optimize customer choice. I felt encouraged when he said that “no one vendor can offer libraries all the resources” they need, and that if you want to use an EBSCO product for one part of your workflow and a competitor’s product for another part, the workflow should not only work, it “should be optimized.”
Finally, my favorite quotes from the conference:
Scott Vieira, Rice Univ., referring to typical functionality in e-resource management systems: “Forcing the acquisition of e-resources into a linear workflow is like trying to train tortoises to walk in a straight line.”
Marcella Lesher, St. Mary’s Univ., about a journal weeding project (I’m probably paraphrasing): “We’re talking here about the care and feeding of print resources … although at this point we’re probably starving them to death.”