Nov2008ActionItems
BHL Scanning & Metadata Workshop
ACTION ITEMS:
printer friendly
Serial Bidding/Mashup
- Wonderfetch (tm) needs a place for the Mashup identifier of the merged serial title
- Chris, Keri, Bernard, and Diane
- Mashup needs to expose the id of the merged serial title
- Editing capabilities in mashup, title merger and clean up of data as needed functionality added
- Definitions of Bids (partials, full, recording needs in mash up). Group formed to work on these topics
- Clean up and merging of serial titles
- All BHL staff as bidding Serials
- Retrospectively merging titles. Making sure bids are reviewed. If you notice overlapping bids contact libraries.
- Portal communicates to Mashup to know what has been scanned
- Bernard, Michael and Chris
- Series/Serial: check series in serial mash up to use that id to group in portal. If no serial record in mashup, just keep only monographic flow. Bid even if you do not have holdings attached in mashup
Monographs processing and general workflow issues
- Bound withs – all titles will be cataloged and processed to the dedupper.
- Suzanne needs to review the Smithsonian procedures
- Monographing DeDuping: Fuzzy matching on titles, bulk deletions
- Monographic DeDuping: Retrospective of all monographs scanning and all monograph rejects need to be processed on the dedupper
- Recording of information that we know we lack to deal with in the future. Wiki page for now
- Everyone record their own way what is being rejected for future comparison.
Portal Searching, Editing, Display, etc. Enhancements
- Searching: date sort, date search, complex searching and expanded full text of all record. Review WorldCat.org ; MBL’s advanced search ; ZooRecord ; Itunes.
- Diane , Doug, Don, Michael, Chris, Suzanne
- Editing the Portal: First one in has to let it “spin up” – Bernard’s shop will potentially be the first daily.
- Editing Portal: All items from one to the other. Be able to add more than one. Pull out some if all.
- Display of items (by 14th) : Group on to portal, view, volume, download, more information, general item level design
- Michael, Chris, John M., Maggie R.
- Monographic separate with series records, how do we reference both series and monograph in the portal without one “winning” or being the primary. The catalog separate title and the volume within the series
- Kevin, Chris, Michael and Diane
- IA - BHL linking: if a title falls in the biodiversity grouping, IA should point to the BHL URL
- Portal edited: change to not let leader be edited; review uniform title field data in records current in portal; institution name does not need to be edited; add multiple language options looking at the data supplied in MARC fields; break apart the 260 data so that parts can be edited; removal of the TL2 hold over from Tropicos; add preceding and succeeding titles, splits and mergers data.
- Send URL and how to edit portal to group
- Record date and users of editors. Review needs and make recommendations to the group by next Friday about what the portal needs to keep in regard to data editing history
- Duplications is fine. Merging into one record.
- Michael makes merging work
- BHL Staff
- Portal gives explanation of how separate serials vs monographs can be determined with the file that is exported
- Verify where the new oclc record for the digital manifestation stores old OCLC number
- BHL Portal needs to incorporate the new digital manifestation, OCLC record, ISBN, etc.
- Michael, Chris and Suzanne
- OCLC assign OCLC numbers for digital manifestations outside the Bowker group – how, what does the new record look like etc.
- Cathy and Suzanne discusses Bill C
- Bowker need corrected data as we clean up? How or does this effect identifier.
- OCLC can we edit the digital version, editing level in OCLC, do we need to keep in synch?
- Review of OCLC’s Bowker mapping of print to e-version and Bowker metadata needs
- ILS overlay over portal, investigate graduate student to work with MoBot team
- Martin, John F. and John M
- Searching: Date sort, date search, complex searching and expanded full text of all record. Review WorldCat.org and MBL’s advanced search. ZooRecord, Itunes.
- Diane , Doug, Don, Michael, Chris, Suzanne
Data sources outside our current membership scanning
- Investigate IA content (subset or other scanning places): processing with the DeDupper and the Serial Bid list. Having these tools and the portal and IA data working together
- John F., Bernard, Michael and Chris
- Explore ramification so incorporating grabbing data from IA.
- Institutional Council importance of communication down to us about decisions regarding the acceptance of data outside of our established workflow (e.g. California Digital Libraries, etc)
- Volunteer hours from people interested in the project; “in kind gift” to help and get them “credit” to the BHL contribution.
- Framework established that can then be used to fit the idea and projects for interns and graduate students in the development of BHL portal and data sets.
- Develop in parallel new workflows – BHL Europe with money to help us to learn from our pilot project. Task to BHL funding now or BHL Europe with start in January. Tools need maturity. Workflow documentation is needed to show how this works. Scanning vendor neutral.
- Bernard, Chris and Martin
- Remind EC about paper reviewing OCLC collection analysis tool and recommendation using money from OCLC Analysis and redirect to new ways of working with our data and establishing robust tools for upcoming incorporation of BHL Europe and other data sources.
Additional Scanned Data in IA
- Investigate IA content (subsets such as California Digital Library) and incorporating into DeDuper and Serial Mashup Bidding List. General communication of these tools with IA and the Portal
- John F., Michael, Chris, Bernard
- Investigate ramification of incorporating data from IA – grab and go
Internet Archive
- Definitions of Quality Assurance, what it means across all the scanning centers and operations, statistical analysis, 100 percent; review PDFs vs Images.
- Identify if IA has made an identifier dark, semi-dark, missing, suppression.
- All BHL Staff inform Michael and Chris
- Redirects as needed from IA made dark or darkish
- Every other week call with IA with a representative from each scanning libraries. Topics to include ASAP: IA Issues: Bound withs – NY and DC not. Boston yes.; Electronic lists of rejected information. NY not Wonderfetching ™. Boston’s communication with book loaders to BHL library on questions and issues. Images of pages and questions and emails.
- Martin to find representatives from each scanning library.
- IA clarity on the ability to completely dark (identifier does not pull anything up at all) and partially dark – (dig in identifier to metadata to find “frozen” or something). We need to know how to go completely dark and communicate to Portal when dark.
- Boston, NY, DC foldouts, doing them and quality. Foldout summit to include Library of congress
- Martin to organize with Bernard, Diane, Don
- Billing structure needs to be reviewed by the EC group to look at the BHL portal as the billing amount; statistics are recorded etc.
- Scanning consistencies with pods and individual set ups: Wonderfetch™; bound-withs, electronic invoices, electronic pick/packing lists returned to sites; fold outs to a common standard, consistencies of prices, serial volume stop start with in bound works
- Martin with Tom G on IA calls. Martin, Chris and Doug emphasize to Executive Committee
- Executive Committee clearly informed on the issues the individual institutions are finding with the scanning and set up and changes need to be done with in a specific time frame
- Alternative scanning vendor. Investigate other options. We need to start a group to investigate other vendors – some BHLers have alternatives NYBot and IMLS and others.
- Bernard, Kevin, Doug, John M. , Keri and Martin investigators.
- Contribute to the Wiki page of concerns IA page issues and concerns. Review and add general and specifics
Vetted bibliographies of wish lists to digitize
- Sub group to look at combining bibliographies: Bib app from Michigan potential help with bibliographies code4 lib code.google.com/p/bibapp/ ; Zotero commons another group ; add to the new bhl employee
- Pull all lists together in master list publish it out Reference management system to manage bibliographies until something more robust.
- Julius at NH London; Bernard, Chris, John F.
- The era of Ready Fire Aim is now over!
- All BHL Staff coordinate efforts
Article Repository topics
- Continue build your own IR. Investigate aggregation of these to pull into BHL Article Repository an development
- Chris and Michael and Phil
- Status AMNH repository plan
Executive Council Issues
- Communication with the Executive Council and decisions and projects to BHL Staff specifically the inclusion of the other IA data (CDL, etc.)
- Investigate volunteers who are excited about our project who can participate in by time and expertise and get credit as a “member” of BHL
- Chris and Martin and Doug
- Framework of where projects could be plugged in for volunteers and grad students to work on BHL portal and data sets
- BHL become a legal entity could help with some legal issues etc. Possibly time to review that status by EC.
Future Meetings and Events:
- Request Executive Counsil to budget a BHL Staff meeting once a year
- Schedule a monthly BHL Staff call 10 Central, 11 Eastern, 4 GMT. Doodle a standing date a month. Ask Betsy K if she would like to join. Coordinate to have the call before the Executive Counsel call so items can be brought to their attention.
Parking Lot
- Parking Lot: What should we do about the things we know we lack, can’t scan, etc that we are going to (for now) record on the wiki?
- Parking Lot: Future comparison of what has been rejected and why. Everyone will be recording reject their own way for now.
- Parking Lot: Monograph rejects - Fill in process – NexGen BHL
- Parking Lot: Future we need user analysis of BHL
- Parking lot: tracking the article repository concept. Fedora as the repository pulling together other sources as a proof of model. BHL Portal could search across articles and book structured data. Youtube model of not a free for all but more of an ability to take down things “in appropriate” – copyright etc. ArcX ? group. Safe harbor model?
- Parking Lot: Adding in missing pages down the road –out side of the IA workflow
- Parking Lot: Frankenbook investigation.
- Parking Lot: Other vendors for scanning, how we can edit the metadata etc.