Jan 20 2015
Tech meeting 1/20/15
Mike, Trish, William, Bianca
BHL
User generated PDFs – status? We said we wanted to include them. About a year since we discussed. Bianca will talk with Martin as to when we want to do it. Martin would still like to include – can we do by June? Yes. This would come in similar to Biostor data – article md will be searchable and just link to pages not pdfs. We should do a soft launch to make sure its incorporated well first and we don’t’ see a lot of messy metadata. Someone will need to alert Rod Page before it goes live .
JEANH – Bianca asks could we scan actual volumes and then blend the article md with the scanned volumes? Something to consider for the future.
RJB – links to their PDFs have not been working recently because of staffing issues. Seems to be back now but what is long term reliability of their links?
Contributor browse – new layout suggestion from Bianca. Mike hasn’t done anything with it yet. No rush. Removing institutional collections since we have contributor browse now. What about Museum Victoria’s landing page – is there a way to move the splash page info to the contributor browse? We could move that content to the More Info button from the Contributor List View if a provider requests it.
Bianca talking to NAL on Friday - they have some md record issues that she will be working through with them that relate to how catalog their content. Some MARC records have such small differences between then that they are getting merged together when they shouldn’t be.
Art of Life
Update of Kalev’s code on Flickr – Joel is done. Mike says there are about .5 million images in Flickr with BHL collection:biodiversity
ConSciCom/Zooniverse –
Mike sent them 23 GB of content files (took 11 hrs to push to Amazon S3 bucket)
Purposeful Gaming
mongoDB ready and mediaWiki is to be done
Next steps with generating better OCR – Mike L will start generating Tesseract outputs. He will use the criteria of English publications from 1900-1923. Mike did a query - 3300 items
Tiltfactor work – waiting on them to send us next games for testing
Text to image linking tool – no updates on Desmond’s site. Trish will email him to get definitive answer on whether toll will be further developed.
Mining Biodiversity
Annotations – Trish and William annotating now.
NGram algorithm – trying to evaluate the algorithm results. William sent them truth files from Europe to compare against what they found.
Looking at social media success in past campaigns - Grace will write an article on the social media aspect of the project.