Dec 8 2014
Tech meeting 12/8/14
Mike, Trish, William, Bianca
BHL
OTS followup? We need to let him know we cannot update their data unless they set up OAI. Bianca will followup
Notes discussion with collections – Notes fields each field is stored individually and Mike L recommends all subfields concatenated into a single string.
User generated PDFs – status? We said we wanted to include them. About a year since we discussed. Bianca will talk with Martin as to when we want to do it.
Followup to OCLC ingest of BHL data? no news from Suzanne
Art of Life
We decided to shut down processing at IA because we are spending too much time trying to keep it running with little benefit. Exports from past 3 week have been empty. Records are being copied by Joel and processed at SIL.
Update of Kalev’s code on Flickr – Joel is reviewing code now and will make recommendations on what needs to be changed based on the metadata we want to be there. We are limited to what books he decided to process – he did some filtering down of all books on IA but we don’t know his decision process. At some point we will need to reconcile the list of images he found with the list of pages with images we found. E.g. what do we do with stuff he identified but our algorithms said no image? Do we want to take all of his images as is or further filter?
ConSciCom/Zooniverse – next meeting Friday 9am
Nov 24th Jim sent update on UI
Hi everyone,
I've pushed an update to the site at
http://demo.zooniverse.org/bhl/ This should fix the bug where you can't mark the page if you click directly on the image to draw a rectangle.
I've added an extra marking step, to demonstrate adding text to points on the image, which might help where an illustration has parts that need marking. There's also a review step at the end, before the classification is submitted.
Trish needs to send fields to include in interface to Jim.
Mike Progress on getting them our images? Mike did a utility for grabbing all images for a particular item.
*Trish take weekly meetings off calendar.
Purposeful Gaming
All staff meeting 10am Wed
IMLS reports due at end of month
Next steps with generating better OCR – Mike L created utility to view outputs for Tesseract and game outputs. Lets meet 1pm Thursday to talk about the output
Mongodb and Wikimedia install – Mike W. has not had time to do yet
Tiltfactor work – they are back onto game design. They asked for extension due to our delays with OCR outputs
Mike L would like to get actual outputs from game for testing.
Text to image linking – we haven’t heard from Desmond about the status of his tool. Not sure if he will be developing an editing tool. Without that and if output doesn’t improve we may not be able to use it. He is presenting on tool at British Library Dec 18th so maybe that will push him to do more development
Mining Biodiversity
Riza’s team testing term finding in corpus – needs years of items. Choosing books to start with – they have experience with abstracts. They have been playing with different ontologies and thesauri. They would prefer to focus on English. Anatoly and Grace – reviewing tools they developed to determine where social media conversations are happening (netlinks, tweets)
Annotations – next steps? We will work on Dec and Jan
Next monthly meeting this Friday at noon.
Other
William out Dec 14-Jan 6th
Trish out Dec 24-30th
Mike is not taking days off so if BHL is down Mike can followup