Aug 25 2014
Tech review meeting 8/25/14
Present: William, Trish, Mike
BHL
Migration of CB content to BHL is pretty much complete - Mike says AMNH OAI feeds are setup but Mike found an issue with ingest so will tweak this week. JSTOR is still a question for ingest since they didn’t respond to Trish about how to filter biodiversity content from IA. Trish cc Chris about his role in next steps.
GNRD name service won’t stay up and running – Dima hasn’t responded to Mike’s emails. New stuff coming in isn’t getting indexed. We could switch back to general public GNRD but not sure how well that is working
Bianca needs Mapping document between MARC record and display in BHL - still to be done by William. Mike has added in adm portal the MARC names next to each field in BHL which could provide a good starting point for William’s document. -
Aug 25, 2014 Thanks guys!
Art of Life
Need to ramp up volunteers for classification – Trish followup with library schools that Richard Hulser recommended.
Need to followup with Gaurav on his summer project and what was learned about extracting data from Wikimedia Commons
Requested phone call with British Library last week to talk with them about their challenges uploading to Flickr – no word yet.
Will be setting up a phone call with ARTstor, Martin and myself in the next few weeks to discuss the policy issues of sharing BHL images with ARTStor. Martin said there was push back from the members when this was brought up “You may recall the mini-firestorm when JSTOR/Plants wanted to PAY US for content that was shouted down by the members”
Conversation with big data researcher Kalav Leetaru – Thursday 28th 1-2pm CST. What are we interested in asking him? Could he identify the colors of the images? Could he help us with bulk uploads to Flickr and extracting data out? We should incorporate our conversations with him into our final report. Possible future grant collaboration?
Purposeful Gaming
Next steps with Tiltfactor
William sent OCR outputs
Subcontract agreement – Trish sent edits to Dartmouth
BHL team is going to decide how to handle accents, special characters, and fractions. We would love a decision on this by next week, if possible. User should treat it as a transcription. If word contains both large and small caps we would want user to transcribe as all large caps because they won’t be able to do formatting while typing. Diacritics are currently ignored in BHL when searching . We’re not sure what solr does.
Decision: We will not require accent characters when users type in the box. Fractions – ¼ =1/4
Automated text to image linking tools - No word from TILT2 folks (sent email Tuesday) Desmond Schmidt, maybe have wrong contact emails? Twitter for Desmond @bltilt
Joe sent some updates on tools hes looking into
Set up meeting with 3 MOBOT team to discuss comparing the OCR – yes. Wed at 10am CST. Mike B will look into parameters for Tesseract to see if he can improve some output like Latin.
Mining Biodiversity
Martin, Grace and William decided we will incorporate AddThis and Google Scholar Md tags to BHL Put on beta and Give EC one week to review. Mike wants to know how we want to implement. Should it just replace the FB like and Tweet buttons at the top within the page viewer? Probably better than sidebar otherwise it will cover up content on BHL
Google scholar md tags – need this along with AddThis in order to know the article citation from which a page is referenced.
New BHL Privacy policy – how different is this from old policy? Old policy says we wont use commercially but now users have to opt out in order to not have their info used commercially. If you opt out you have to do it on each computer that you use and every browser on that computer. Its based on the cookies that get applied from each browser.
Survey to users about semantic searching – out and getting feedback.