BHL
Archive
This is a read-only archive of the BHL Staff Wiki as it appeared on Sept 21, 2018. This archive is searchable using the search box on the left, but the search may be limited in the results it can provide.

Oct 12 2015

Tech meeting 10/12/15
William, Trish, Mike
BHL general
IA ingest failed this weekend but Mike got it going today so just a couple of days behind
Disqus updates ready to go – Mike sent latest changes to Nathan. Will go live next Monday
Finding inconsistencies with GNRD (name finding algorithms) Bianca and Mike have emailed Dima repeatedly but getting no response. Mike noticed that uBio and taxonFinder service are both offline today. Not sure how long they’ve been offline. William has been in contact with Dima about his plans for GNRD. Should we bring the systems in-house and support here? We’d have to know more about how the systems interact. Mike is concerned setting it up could be a challenge. Source code is available so we could take a shot at setting it up. Problem is systems are written in 3 different languages. William will talk to Dima about a timeline and will talk to Martin about his preferences.
DOIs - Mike says still in limbo - on pause til we decide what to assign DOIs to. Articles w/o ISSNs - Cross ref says to deposit as “reports” but you can’t find it in CrossRef if you search on an article by that title. How do we want to handle? Also Rod Page’s issue – dataCite and CrossRef. He found a journal article with ids from both. He’s wondering how a journal has 2 different DOIs. The end user should not have to reconcile these ids. We need to have a larger discussion with Martin and others before Mike can move forward.
Mike setup Process for grabbing Flickr tags once a month - pulled into table in BHL database for when we want to do something with it
Mike talked with Joel week before last and he was going to meet with their IT folks to verify but think Timeline for servers would be late October but still need to have everything installed and configured. Strategy for moving OCR files - looking at synching tools – MS Synchtoy but can’t handle our volume, BitTorrent sync but will be blocked on SIL network. Worst case we’ll ship hard drives and will have to stop and restart ingest.
Database server at MOBOT – maintenance plans are taking 12+ hrs per night. Databases may be too big and memory on machine is not sufficient but Mike L has investigated and can’t seem to find a specific cause. Mike L is working with Mike Westmoreland to resolve

Purposeful Gaming
Promotion of the games is an ongoing effort but Trish running into a lot of challenges trying to get articles and press releases out there. Working with Patrick, Grace and Max. Trish got MOBOT to do a press release. Still no luck with getting IMLS to promote on their blog.
Trish contacted Post Dispatch to ask if they could cover the game award. Reporter was excited about the idea but doesn’t currently cover the museums in STL and there are no reporters covering it right now. He will try to see what he can do. He did ask us how many players we need to get the content corrected. This became as exercise for Max, Trish and Mike to see if we could come up with stats for that and use as a challenge. Estimated if we could get 9k players to play for at least 60 minutes we’d correct 1k books which could then be applied to other BHL content. Should we have Mike process more books for the game? Yes for now lets have 500 books ready if we get the big promo which draws a lot of players.
Mike moving corrected text from mongoDB to BHL – Mike is creating a tool to pull OCR out of mongo and put into a packed file. Eventually Uploading will be done through the adm portal once this functionality is added (will not happen til after move is complete and things are stable). Will be able to upload all the pages for a book
Monthly meeting this week is on Wed
Trish is reviewing currently budget expenses and what money is left to spend. We have to submit all expenses to IMLS by Nov 30th

Science Gossip –
blog post and challenge tomorrow asking folks to help us finish the existing journals so the new ones can be made public
Data sets now available on github – so anyone who wants to data mine data collected from Zooniverse can grab the data there. Zooniverse is OK with us posting the data there.
Coordinate information coming out of SG did not match original BHL images – this is because Zooniverse scaled them down. By Adding scaling factor to coordinates Mike was able to match them back to original BHL images.


Mining Biodiversity
meeting Friday with Mining Biodiversity. Visual resources group is working again. Grace was invited to present at LITA conference.
Meeting end of January for DID awardees in Glasgow. Organized by JISC.
No-cost extension request - William will be sending a formal request to Trevor Owens.

Expanding Access to Biodiversity

IMLS project to NYBG that officially started October 1st. Trish got email from Susan Fraser announcing the start but no meetings setup yet. She is working on subcontract agreements and will send out shortly. Trish, Martin, Susan F. and Susan L. will all be at IMLS Focus meeting in Nov in New Orleans