tag:blogger.com,1999:blog-19756814.post6484295824252443064..comments2024-02-24T21:57:45.025-08:00Comments on ChrisFreeland.com: BHL metadata improvements & impact on exports / cached dataChris Freelandhttp://www.blogger.com/profile/04291470081749543282noreply@blogger.comBlogger8125tag:blogger.com,1999:blog-19756814.post-88424868812037810952022-04-28T02:39:21.022-07:002022-04-28T02:39:21.022-07:00Hey, I am Albert sir. Awesome Article you have sha...Hey, I am Albert sir. Awesome Article you have shared. Thank you! Many people ask, <a href="https://india-visas.org/indian-e-visa-fee/" rel="nofollow">How much does an e visa cost for India</a> & how much is an e visa for India? The cost of an India visa depends on your nationality and your visa type.<br />Alberthttps://www.blogger.com/profile/09248744782323379317noreply@blogger.comtag:blogger.com,1999:blog-19756814.post-69866232548404586392021-12-05T10:46:47.572-08:002021-12-05T10:46:47.572-08:00This is a wonderful inspiring article. I am practi...This is a wonderful inspiring article. I am practically satisfied with your great work. You have really put together extremely helpful data. Keep it up... Continue... The current process is hassle free, and you can read all kenya visa requirements for US citizens through the Kenya Evisa website, and it only takes 5-10 minutes to complete the Kenya visa application online It will take<br />kenya visa requirements for us citizenshttps://evisakenya.net/kenya-visa-for-us-citizens/noreply@blogger.comtag:blogger.com,1999:blog-19756814.post-67625372419908467462021-11-17T23:37:11.532-08:002021-11-17T23:37:11.532-08:00Your style is unique compared to other people from...Your style is unique compared to other people from whom I have read material. Thanks a lot for posting when I get a chance, guess I'll bookmark this site. US Citizen Travel to Turkey. Complete the online application to Turkey form on the Turkish eVisa website and confirm your application. Within minutes.<br /><br />US Citizen Travel to Turkeyhttps://turkey-e-visa.com/turkey-visa-for-united-states-of-america-citizens/noreply@blogger.comtag:blogger.com,1999:blog-19756814.post-88014329434579165902010-04-03T13:17:30.461-07:002010-04-03T13:17:30.461-07:00"You can thus describe and cite this digitize..."You can thus describe and cite this digitized book as either 'Some mollusks from Afghanistan' or as 'Fieldiana. Zoology, new series, No. 1'".<br /><br />Is this true? Even though the content may be identical, they are different publications, different references and will need different IDs. The apparent duplication is unavoidable because you cannot assume that, although declared to be the same thing, minor 'corrections' have not been made between one version and the next.<br /><br />And as you allude, in matters of taxonomic priority, 'there can be only one'. It is surprising how frequently this parallel publication occurs in the real world. From my perspective, once is too often. :)<br /><br />jimJimhttps://www.blogger.com/profile/02771667084922742749noreply@blogger.comtag:blogger.com,1999:blog-19756814.post-57514321123475709332010-04-03T05:21:03.652-07:002010-04-03T05:21:03.652-07:00Cataloging (metadata) issues, apart from outright ...Cataloging (metadata) issues, apart from outright errors such as the one mentioned, can be nebulous and subjective. Cataloging is as much art as science, developing over many decades within each institution to fitted to their particular collections, preferences, and patrons. BHL faces a remarkable challenge to integrate all this diversity, inconsistency, and duplication into one homogeneous and useful collection. But at the same time, BHL has the unique quality of being a digital library, a virtual library, with all that implies. In a conventional library, a monograph in series can only be shelved as a separate monograph or as a member of the series. In BHL, it can be both. It can, in effect, be in two places at once. Journals may be cataloged under any number of titles, first, best known, or current, American or British style. For example, is the journal called Nachrichtsblatt der Deutschen Malakologischen Gesellschaft, Nachrichtsblatt der Deutschen Malakozoologischen Gesellschaft, <br />or Archiv fuer Molluskenkunde? On BHL, it’s the second. In another library it is the first and third. A search of BHL for any one of them could, and should, find the journal.<br /><br />There are both advantages and problems in multiple copies of items existing in BHL. I think the downside of duplication, messing with statistics of the sort Rod Page likes and Pivot displays and duplication of index terms, is outweighed by the positives. Shortcomings in one copy may be corrected in another. Eccentricities in one volume may be confirmed as general, or not, by a second. In some copies, issue wrappers (helpful in dating the contents) are bound in, while not in others. Useful manuscript notes, indexes, and so on are sometimes to be found in one copy but not another.<br /><br />OCR errors in two copies of the same work are different, resulting in different indexing and search results, thus improving the chances of a name being found in at least one of them. In the three instances that I counted, the number of name page instances that the second copy added were 42+2 (5%), 107+20 (19%), and 33+7 (21%), 16% overall.<br /><br />A final thought, and I will stop babbling. It would be useful for there to be a virtual notes page in each Item on which patrons could record useful information about the item such as “Issue wrappers bound at end”, “Title page and table of contents inserted before index” or “Exact dates of publication listed in front matter of volume 32.”<br /><br />Pat LaFolletteAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-19756814.post-76587493532691763982010-04-03T05:19:53.231-07:002010-04-03T05:19:53.231-07:00Chris, you have opened the door for a flood of acc...Chris, you have opened the door for a flood of accumulated notes and comments.<br /> <br />BHL is an entirely new kind of library, something never before seen. This presents an array of problems not faced by traditional libraries, but also creates the opportunity to experiment, and to solve problems in new and useful ways. BHL fills a variety of needs, and creates new needs not anticipated before BHL made them possible. <br /> <br />Need for link stability: In the application I am collaborating with Evgeniy Meyke to develop, we link taxon names to the specific page images on which they occur. Many of these pages are from BHL. Others are from other Internet sources, author’s PDFs, and my own literature scans. Regarding BHL content, I extract relevant papers from BHL Items, and keep them locally. I also store the BHL PageID and ItemID, but these only come into play when sharing data with others. For raw BHL search results, however, only the PageID and ItemID are stored. These search results may age for some months while I evaluate and process them. Thus far I have encountered only one missing item, Proceedings of the USNM, volume 41 (http://www.biodiversitylibrary.org/item/53811). I inquired about what may have happened to it a month ago today. You said that it had been “marked offline” due to serious errors, and would be rescanned and reindexed. (As of today it is still offline.) This was the first indication we had that there might be a problem with BHL identifier durability. <br /><br />I also store the sources of bibliographic citations, and reviews of papers that I may need to consult again, from, for example, Revue Critique de Paléozoologie or Archiv für Naturgeschichte. For these, I store only the BHL page link, not the page or article itself. I now understand that these references are fragile, in that if the pageID changes, the source will be lost. <br /><br />For my purposes, then, the reliability of PageID and ItemID references are most important, TitleID not so much. The needs of others are, of course, different. One example: Robert Moore or the Pacific Conchological Club has been compiling a catalog of works pertaining to Mollusca that are available on the Internet for more than a year. His bibliography, in the form of five PDF documents (http://mysite.verizon.net/tjrutkas/page7.html ) includes many BHL items, though I believe all of the links are by way of IA URLs. If an item goes dark on IA, his link would be broken.<br /><br />It is clear that some changes are unavoidable. The question, then, is what can BHL do to mitigate the impact? There is no getting around the copyright issue you mentioned. The item must be removed. But rather than simply deleting all trace of it, perhaps the title, its item(s), and the pages could be replaced with a message explaining why it is no longer available. <br /><br />Regarding BHL items with errors, is it really necessary to completely remove and replace them? In some cases, perhaps. But I am guessing that in most cases it is a matter of tradeoffs and the relative value placed on expedience vs. stability. With the existing workflow and division of labor, it is surely easier to replace an entire volume than to fix it. Based on the small sample of BHL items I have personally examined (~2%), the few problems I have encountered consist of a few missing or poorly photographed pages, or the original book itself is defective. Defective items could be repaired, preserving the existing itemID and pageID links. The books should not be removed from availability while being repaired. If it is the book, and not the copy, that is defective, it should be left as is, a second copy added if opportunity presents. When a volume must be replaced, the original item and pages should be excluded from indexing but left accessible to external links and include a referral or link to the replacement Item. I no longer report content problems because I don’t want to be responsible for causing the items to be removed, even temporarily, from availability on BHL.<br /><br />To be continued...Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-19756814.post-30153996277889545922010-04-02T02:15:51.415-07:002010-04-02T02:15:51.415-07:00Thanks for the explanation Chris. I guess I encoun...Thanks for the explanation Chris. I guess I encountered these problems because for various reasons (such as changing structure of the BHL data dump) I've found it easier to use the original download of BHL data I got last year, and then scrape the BHL web page for specific items as I need them. Sometimes I get caught out by the changes you describe, so that my local version of BHL is out of sync with BHL itself. <br /><br />ItemIDs being reassigned to new TitleIDs can cause problems, as my code has a set of rules for some titles that require special treatment (e.g., in cases where BHL doesn't have enough metadata for me to accurately find an article). I guess there's no solution other than manually investigating when things go wrong.<br /><br />In the example you describe, I hadn't bothered to check that the problem was at the title level. I saw "Lietuvos TSR Mokslu Akademijos Darbuotoju Knygu ir Straipsniu Bibliografija" and assumed the item was part of that title, rather than being a volume of Proc Zool Soc assigned to the wrong title.Roderic Pagehttps://www.blogger.com/profile/00269598293846172649noreply@blogger.comtag:blogger.com,1999:blog-19756814.post-31892218018803046652010-03-27T09:37:59.214-07:002010-03-27T09:37:59.214-07:00Random first thought (which plays havoc with "...Random first thought (which plays havoc with "unique". Could multiple BHL identifiers (for items that fall into the classes described above) all resolve to a new "standard" BHL identifier?UDCMRKhttps://www.blogger.com/profile/14068983914064117898noreply@blogger.com