2009 Architecture Meeting Minutes
printer friendly
Project Plan: BHL_DevPriorities_09.pdf (PDF) or
BHL_DevPriorities_09.mpp (Microsoft Project)
last updated 3/18/2009
Monday, March 9
Review Action Item: Review thumbnails of PDF generator
- Assigned: Mike L
- Date: April 30
Action Item: Creation of Dissemination/Publicity Group
- Assigned: Execs & IC
- Date: March 31
Action Item: Outreach for more effective use of PDF Generator
- Assigned: Dissemination/Publicity Group
- Date: Apr 30
Action Item: Size files from MoBot in the PDF Generator
- Assigned: Mike L & Chris M & Chris F
- Date: April 30
Action Item: Articlizing metadata review of capture to see how it is going and for development
- Assigned: Bianca & Mike L (assist with data export)
- Date: July 31
Action Item: Add more links from Wikipedia.
- Assigned: Dissemination/Publicity Group
- Date: Ongoing
Action Item: Wikipedia as content delivery for every title. Highlight specifics with pages.
- Assigned: Dissemintation/Publicity Group
- Date: Ongoing
Development Priorities
Action Item: Key dates send to Chris F. Revisit this priority list with more time lines and milestones.
- Assigned: All attendees
- Task: Create wiki page (Chris F)
- Date: April 30
Action Item: Investigate with Zotero Commons / Dev Roadmap
- Assigned: Martin K
- Date: April 30
Action Item: Chris revises list with discussion from meeting
- Assigned: Chris F
- Date: Before IC Meeting
Action Item: Review Richard Pyle’s Zoobank and use of LSIDs
- Assigned: Chris F
- Date: October 31
Action Item: by July OCLC ISBN discussion
- Assigned: Martin & Suzanne
- Date: July 31
Action Item: BHL to EOL from species discovery bibliography.
- Assigned: Mike L & Patrick
- Date: July 31
BHL-Europe
Action Item: Europeana bringing in the citizen into the BHL-Europe and in-fact EOL demand. Impact on selections. EOL needs.
- Assigned: Paddy, Henning & Tom
- Date: Completion May 1 [revised to 12/31?]
Action Item: October European Digital Library meeting. Need to potentially identify participants.
- Assigned: Exec Committee
- Date: March 21 abstract due date
Action Item: eBiosphere potential networking
- Assigned: Remind Exec Committee/Institutional Council attendees
- Date: May 29
Action Item: Start a FAQ on BHL wiki responding to common user complaints/suggestions, with goal of migrating to public-facing BHL-E project wiki. Post known problems and status of issues. Help frustration levels. Explore staffing for promoting what we have and what we know we don’t have or need to fix
- Assigned: Martin & Chris F & Connie, then all join in the fun
- Date: April 30
Action Item: BHL – Europe membership to the BHL Wiki space.
- Assigned: Henning to advertise to BHL-E participants
- Date: April 30
Scanning Priorities
Action Item: Index Animalium use as prioritization. BHL Portal key value pairs with Title with an identifier (TL-2). Is able to fuzzy match.
- Assigned: Suzanne & Bianca, Patrick dragged kicking and screaming
- Date: June 30
Action Item: Reconstitute a new Collections Group. Bianca lead. Exec/IC committee will work on formation of subgroup.
- Assigned: Bianca lead subgroup formed by Execs
- Date: April 30
Action Item: Look at taxon specific groupings/classification/counts of what content has been represented in BHL.
- Assigned: Mike L & Chris with Bianca & BIG
- Date: July 31
Content Acquisition
Action Item: EOL’s LifeDesk connection to the Article Repository
- Assigned: Phil, Chris M, Chris F, Bianca, BIG, Cyndy Parr
- Date: Wednesday, April 30
Action Item: Policy discussion around copyright/use re: Article Repository
- Assigned: Execs, IC
- Date: April 30
Action Item: Johann Bollen Los Alamos use of triple stores and the semantic web use of citations.
- Assigned: Ahmed, Chris F
- Date: April 30
Action Item: Search over BHL Portal and the Article Repository – Article articlized source from BHL into the article repository keep “open” and not locked down into group.
- Assigned: BHL Dev Team
- Date: July 31
Action Item: Identify the connection of BHL Portal titles and articles.
- Assigned: BHL Dev Team
- Date: July 31
Action Item: Gather test group & use cases to help define functionality seed groups to encourage community building. ZooBank’s citation needs of handles in the Biblio (Richard Pyle)
- Assigned: Tom & Bianca & Chris F
- Date: April 30
Action Item: Can biblio pull the xmp data out from pdf?
- Assigned: Phil & Chris M & Mike L, Suzanne
- Date: July 31
Action Item: All sign up for the Article Repository & provide feedback
- Assigned: All
- Date: April 30
Action Item: Name the Article Repository -
Captain Kidd Spitball, caveat linktor
- Assigned: Execs & Creative people
- Date: April 30
Action Item: Small workgroup/ task force to examine requirements for a dedupping and bid list. Report to Henning. Executive Committee designed. Review Open Library’s dedupping algorithms.
- Assigned: Bianca, Diane, Matt, Bernard, Ryan, Joe advise the BHL Europe
- Date: April 30
Action Item: Google Docs BHL Digitization Specs link send to group and any updates that have been created on the wiki (Bernard?) It needs to be reviewed to see if we have a common lowest denominator of needs. Develop Strawman for discussion. Deliver to Henning
- Assigned: Martin, Chris F, Bernard, Mike L, Adrian
- Date: April 30
Action Item: Survey of BHL – Europe like we surveyed ourselves to find out what the data is like, what do they have. Needs to go out before BHL – Europe meeting.
Questionnaire.doc
- Assigned: Tom will post to wiki & everyone review
- Date: April 30
Action Item: Define METS profile for SIP/scanned content
- Assigned: Chris F & Terry Catapano
- Date: July 31
Action Item: Europeana wants to ingest – Documentation of ways to get data out – Repositories that want our data. Static export done over times but not a feed, OAI, SOAP, REST, etc.
- Assigned: BHL Dev Team
- Date: July 31
Action Item: Take sample from California Digital and dedupe what we have done already. Then would be to bring it into our prescanning dedupping processes.
- Assigned: Mike L (& Bianca Lipscomb)
- Date: June 30
Action Item: Review Open Library dedupe algorithm, possible change of existing BHL practice.
- Assigned: Mike L & Chris & Suzanne
- Date: July 31
Action Item: Identify sources of scan material. Who they are the type of material. Potential contributors by class of donator. BHL Europe, BHL China, IA partners, Publishers, Back Files, BHL Partner libraries.
- Assigned: Tom & Bianca to establish page wiki
- Date: April 30
Content Management
Action item: Follow up with Robert Miller at IA about insertion page – dummy book with insertion before files and after files. See what they do and tell what they do.
- Assigned: Martin (done), Mike L to review, Tom & Cathy to follow up
- Date: April 30
Action item: Cathy find out what Brewster meant by the Migration comment. What migration to what?
- Assigned: Cathy
- Date: April 30
Action Item: Exec. Committee decide if we need to work with IA on correcting data & harmonize data with other locations, ie: Open Library
- Assigned: Exec Committee
- Date: July 31
Action Item: Mike L to look at what records might have mistakes because of diacritics. NY Bot skipped the letter so not as easy to find. NY Bot might be able to give a MARC dump of the date ranges.
- Assigned: Mike L, with suggestions on how to bulk update
- Date: April 30
Action Item: Review portal editing needs with Mike L and Chris F
- Assigned: All attendees/users, Suzanne already started wiki page here.
- Date: Ongoing
Action Item: Notification of record changes. RSS. Tracking, logging of changes.
- Assigned: Mike L to inform what can be done, all to review and discuss what is needed.
- Date: July 31
Action Item: Implications MARC records in to BHL portal. Pointing two records to same scan etc. Describing one set of scanned items as both a monographic series & a serial.
- Assigned: Mike L, Diane & Suzanne
- Date: July 31
Action Item: Volume information normalized. If find examples send to Chris. F and Mike to see on multivolume sets for fill in gaps to get sequence.
- Assigned: All users
- Date: Ongoing
Action Item: Write best practice document for multi-part complex bibliographic items as used by biodiversity scholars.
- Assigned: Tom, Bernard, Matt, Suzanne
- Date: Draft by April 30
Action Item: Exec Committee retrospective portal clean up data – expectations?
- Assigned: Exec Committee to take up with Institutional Council
- Date: Raise with Execs on Tuesday, March 31
[from here up, dates updated according to MicroProject file on 3/19--B. Lipscomb]
Tuesday, March 10
Content Delivery
Action Item: Suzanne adds Portal Editing Wiki Page to the action item
- Assigned: Suzanne
- Date:Done
Action Item: Enhance pdf deliverable and email to explain information
- Assigned: Mike L
- Date: April 30
Action Item: Scim open source pdf reader for Mac’s. Review needed.
- Assigned: Mike L & Chris F
- Date: April 30
Action Item: Better integration of OCR to the PDF deliverables. Prices etc.
- Assigned: Chris F & Mike L
- Date: July 31
Action Item: Everyone needs to gather information PDF forms and deliverables. Dissemination group to announce the pdf and collect information.
- Assigned: All
- Date: July 31
Action Item: Dissemination group way to facilitate the gathering of user feedback and end users on portal development
- Assigned: Dissemination group
- Date: July 31
Action Item: BHL Dev Team review indexing MARC for title-level access
- Assigned: BHL Dev Team
- Date: July 31
Action Item: BHL Dev Team Solr index of keyword across all OCR text. Is this really needed. Data mining tools might be over kill. Review what implications are.
- Assigned: BHL Dev Team
- Date: July 31
Action Item: Investigate techniques for place name searching.
- Assigned: Ahmed & BHL Dev Team
- Date: Oct 31
Action Item: Work with IA on the page types from IA for helping in identification of Illustrations.
- Assigned: Mike L & Chris F
- Date: July 31
Action item: Revisit delivery of thumbnails or small page images for browse – visual exploration of BHL portal. Adobe side board.
- Assigned: Mike L, and is also addressed by inclusion of link to FlipBook (beta)
- Date: April 30
Hardware Infrastructure
Action item: Further discussions with BHL – Europe Adrian with BHL Dev team on data moving and speed etc.
- Assigned: Adrian & BHL Dev Team
- Date: April 30
Action item: Chris F. and Adrian discuss the way to have BHL Europe builds its first deliverable mirror and planning the next platform and load balancing. draft a possibilities to immediate solutions with the goal of the Darwin and Datanet as the long term
- Assigned: Adrian, Chris F, Tom G, Phil, Henning
- Date: April 30
Action Item: Datanet Tom G discussion with Datanet as potential use as a dark archive for BHL
- Assigned: Tom G
- Date: July 31
Action Item: Phil looking at BitTorrent as an alternative distribution model
- Assigned: Phil
- Date: July 31
Action item: Using Fedora with petabox solution redundance IA content
- Assigned: Chris F, Phil, Ahmed & Sam (IA)
- Date: April 30
Action item: Chris and Adrian and Henning to talk before 2-3 Europeana kick off Netherlands OCR conference where AIT (Austrian) in the Hague in April 6-7– Adrian and Henning. Determine Chris’ availability.
- Assigned: Chris & Adrian & Henning
- Date: By March 20
Action item: Tom find out the SI resolution by end of march
- Assigned: Tom G & Chris F
- Date: March 30
Action item: Chinese Academy of Science discussion continues
- Assigned: Tom G
- Date: Ongoing
Action Item: Cathy to monitor the resource space that is in the Otis Airbase request
- Assigned: Cathy N
- Date: April 30
ViTaL
Discussed at lunch.
Action Item: Evaluate BHL exports for incorporation into ViTaL/SFX
- Assigned: Bernard
- Date: April 30
Action Item: Evaluate ingest of AnimalBase into Drupal/Biblio via OAI
- Assigned: Chris F, Phil & Chris M
- Date: April 30
Action Item: Evaluate
Falx, which is ingesting Scratchpad bibliographies via OAI; determine shared vision for global bibliography management (including LifeDesks & Scratchpads)
- Assigned: Chris F, Phil, Bernard, David S, Vince Smith, Simon Rycroft
- Date: April 30
Data Mining
Action Item: Chris M and Chris F provide Amhed page identifier Evaluation set for last summer evaluation of Taxonfinder 2J
- Assigned: Chris F
- Date: DONE!
Action item: Amhed to report evaluation to Cathy and Chris
- Assigned: Ahmed
- Date: March 30
Action item: Drupal module for TaxonFinder2J
- Assigned: Chris M, Ryan, Ahmed, Phil, Chris F
- Date: July 31
Action item: Stable url verify with Amhed and Ryan for the Taxonfinder 2J
- Assigned: Chris F
- Date: Tomorrow
Action item: Mike results of Zea mays number count seemed off from search and discover bibliography.
- Assigned: Mike L
- Date: April 30
Action item: Follow up with Patrick L about the development of the API on EOL of the name synonymies service
- Assigned: Patrick & BHL Dev Team
- Date: July 31
Action item: taxonfinder 2J against gray names in Mobot’s storage of names
- Assigned: Mike L & Ahmed
- Date: July 31
Action item: Tom G. to follow up with Lee Giles check in to see what is going with the latest Penn State request
- Assigned: Tom G
- Date: April 30
Action Item: Cathy, Holly and others George Toma at NLM follow up on markup
- Assigned: Cathy, Holly
- Date: April 30
Action item: Tom G. investigate partners for research grant for automated article metadata structure. Google summer of code. Berkley I school.
- Assigned: Tom G + IC
- Date: July 31
Action item: Dissemination group look at the use of invitation the pdf articlizer.
- Assigned: Dissemination Group
- Date: July 31
Action item: Chris F. to contact Vince Smith about google contacts.
- Assigned: Chris F
- Date: March 30
Portal development
Action item: LC Flip book. Mike Hand at LC, Joe , John M., Chris F. and Martin to work with the move to modular approach.
- Assigned: Martin & Chris F
- Date: April 30
Action item: IA flip book beta in the BHL Portal beta site for bhl members to look at. Testing swap out ability
- Assigned: Mike L
- Date: March 30
Action item: exports include all pages or pages only with names? Mike L. Can we give everything that we offer.
- Assigned: Mike L
- Date: March 30
Action item: TDWG literature interest group on specs citation and title resolver for biology literature. John M. and Chris F. David Remsen.
- Assigned: Chris F, John M, Remsen
- Date: October 31
Action item: Article Repository – DOI’s from articles in the repository. Proof of concept some hight impact titles. David Shorthouse and Jim Edwards and DOI deal. Linnea monographs. Chris F., Tom.
- Assigned: Chris F, Tom, Mike L, David Shorthouse, Jim Edwards
- Date: July 31
Action Item: Outreach group social networking web2.0 opportunities. Institutional Counsel or list. Tom G to do the discussion list.
- Assigned: Tom G
- Date: March 30
Action item: Search engine optimization. Phil and John M. planet software Code4Lib – aggregate that blog on similar topics.
- Assigned: Phil & John M
- Date: April 30
Outreach
Action item: Martin will create list of all of the social networking BHL is participating in currently.
Action item: Strategic plan of BHL has communication parts. Susan F. include some of the social networking group.
- Assigned: Tom, Susan F, Nancy
- Date: March 30
Action item: One page list ideas of monetizing the bhl data for incorporation in the sustainability plan.
- Assigned: Tom
- Date: March 30
Action Item: Add to monetizing in sustainability plan the Kirtas deal. Investigate the Abe book thing.
- Assigned: Tom
- Date: April 30
Action Item: Email list more active – technical list created with Henning and Adrian attached to the technical list. Technical committee. Form the committee with specific technical group.
- Assigned: Phil & Chris F
- Date: March 30
Recap & Administrivia
Action Item: Suzanne send action item to Tom
- Assigned: Suzanne
- Date: March 13
Action Item: Tom sends to everyone the action item telling those with assignments.
- Assigned: Tom G
- Date: March 16
Action Item: Technical committee to decide on follow up next actions and meetings etc.
- Assigned: Chris F, Tom G
- Date: March 30
Action Item: Chris prepare one page executive summary for IC
- Assigned: Chris F
- Date: March 19
Wednesday, March 11
Name Finding
NameFinding notes - Patrick & Paddy
Action Item: BHL to describe high level description of needs (framed during discussion) and input into Jira; determine a date by which we need this
- Assigned: Tom G & BHL Dev Team
- Date: March 30
Action Item: Organize meeting around NameFinding (with vernaculars) & NameReconcilliation before June 30th
- Assigned: Tom, Paddy, Chris
- Date: March 30
Nomenclatural Acts
Action Item: Review search interface to make it more article-friendly with existing metadata present in BHL
- Assigned: BHL Dev Team & Patrick L
- Date: April 30
Action Item: Tom to describe Nomenclatural Acts issue in Jira
- Assigned: Tom G
- Date: March 30
Automated Markup
Action Item: Chris to pick back up with reCAPTCHA
- Assigned: Chris F
- Date: March 30
Action Item: Tom to discuss OCR rekeying with Chinese Academy of Science
- Assigned: Tom G
- Date: April 30
Action Item: Chris & Mike L, with Patrick, to think about/look at leaving tags for names in OCR
- Assigned: Tom G & BHL Dev Team
- Date: April 30
Action Item: Reevaluate wiki for rekeying/markup as OCR is major barrier
- Assigned: Phil & Chris M
- Date: April 30
LifeDesk/BHL Integration
LifeDesk/BHL integration notes - David S & Paddy
Action Item: Write a document for a shared vision & identify coding needs; where is overlap what works needs to be done.
- Assigned: Chris F & David S
- Date: March 19
Decisions
Review discussion from Day 1 & include decisions here
Decision: Volume item and date information can be done by anyone.
Parking Lot
Statistics of EOL needed Patrick and BHL – number of species BHL link. We also need BHL volumes not title.
Interface with Image server with Jpeg 2000 help user
IA metadata issues and redundant
ISBN and Metadata for digital manifestation records (Cornell)
Looking at the article repository to see how data is being used and how the cross information
Copy right issues/fair use and safe harbor concepts
NLM DTD wrapper
Images from Abby documentation from IA – there but need to pull out.
Tuesday?
Redundant issues
Content and PDF delivery
Can’t find articles they want
Files too large
OCR problems
Wiki for every page or wiki for every book for public editing.
Wednesday
Taxonfinder development – tomorrow