BHL
Archive
This is a read-only archive of the BHL Staff Wiki as it appeared on Sept 21, 2018. This archive is searchable using the search box on the left, but the search may be limited in the results it can provide.

TechCall_23jan2017

Agenda
Joe, Katie, Pam, Susan, Trish, Mike, Ari, Carolyn, Joel

1) Dima Mozzherin in St. Louis this Thursday (January 26) to talk about name services.
Meeting with Mike and William to discuss enhancements to name services and potential for re-indexing BHL. Potential to get feedback more quickly.

2) PDF generation UI enhancements. Is everyone satisfied with the changes? Do we have enough ideas for what to do if users requesting entire books becomes a problem?
Flakiness with images loading in; if scroll around a lot, images are not necessarily loaded in. Not necessarily related to the changes.
Are we comfortable with pushing new feature out?
Yes.

Other PDF things - could be an opt-in for newsletter and Donate button (per Grace's Gemini ticket)

Also, what do we want to do with the article metadata we're collecting?
Joel will see if there's already a ticket about this.

If they've come into from part / number,

If they're sitting on an article, title of book and title of article.

Could probably find a way to pre-select the pages.

People want title page of journal and article. Kind of common practice in ILL community

Technical issues that might come up would be the book reader code was written by IA about 7 years ago, and the other part by Australia frmo 4 years ago. It's a little obsolete.

Does BHL Australia have any plans to update their code? They wrote the code for our portal; they were part of the re-design
No discussion about moving to a JQuery


3) Transcriptions in BHL (Katie and Joe, MCZ)

Taking a look at tools that were used for Purposeful Gaming, DigiVol and FromThePage. Reviewing the methods for producing transcription files. And looking at in general what's being used in other places.

What do we want to do with a tool of this sort?

Do we want to develop our own tool using open source software? Integrated with BHL Portal?

Perhaps it's simpler to use an external one and then importing.

Susan - would like to arrange a longer call focused on this topic. NYBG started a digitization with a transcription component a few months ago to digitized and crowdsource transcriptions. Selected FTP, already have a site created.

After transcriptions are complete, Ben Brumfield will put transcriptions into zip file so one transcription page per file. Transcription text will occupy where OCR text is.

We'll have several sources of transcriptions. Even if integrate a main one into the portal, we'll likely continue to collect through other external systems; for example, SI transcription center.

Once transcription text is "done" is it really done? What do we do when we get gemini issues about corrections.
So thinking about versioning.
Overall project is not just how to collect them, but also how to display and deal with crowdsourced corrections.
And thinking about validation for corrections.

If corrected later, what does that mean for data mining for scientific names?

FTP supports transcribers picking out scientific names

Virtual intro for Katie and Nicole Kearney

How do we want to handle mark up in transcriptions?

Whether the markup from FTP, does that interfere with data mining?

Ben Brumfeld was in NYC on Friday; Can get a TEI export, mark up language files .md, or plain text

We'll be indexing transcriptions for search, so if it has a lot of different mark up it could interfere with that

If we just replace the OCR files, the markup will just be displayed as is.
Or we'd have to consider how else to modify the interface.

Becomes a whole separate issue.

Would be helfpul to have a matrix of approaches with pros and cons of each.
Ties into Ari's work with images
She'll be more focused on making images more searchable. Begs question of having users tag images outside of BHL, or also within BHL?

Susan would like to arrange a meeting with Joe and Katie about the NYBG grant.



Susan: Linnaeus Link - mission , assigned numbers to each of these works. Gather info on libraries around world that have Linnaean publications on shelves and where digital copies can be found. Linnaeuslink.org Approached NYBG and Chicago, and Harvard Bot; via a meeting in UK last year. We've been working on rounding up copies and making urls available to Linnaeuslink. Would be good to support assigning the number for these; we support it for TL2, but not yet Soulsby identifiers for bib records and articles.


Full Text Search Server - lost in the mail! Aaaargh!