<?xml version="1.0" encoding="utf-8"?>
<feed version="0.3" xmlns="http://purl.org/atom/ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xml:lang="en">
  <title>Control freak</title>
  <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/" />
  <modified>2008-07-01T23:59:51Z</modified>
  <tagline>Library authority control: Q&amp;A and comments</tagline>
  <id>tag:blog.lib.umn.edu,2008:/s-hear/control//1227</id>
  <generator url="http://www.movabletype.org/" version="3.33.uthink">Movable Type</generator>
  <copyright>Copyright (c) 2008, s-hear</copyright>
  <entry>
    <title>I sometimes see odd codes at the beginning of call numbers when doing a call number search. What are they? Do we need them?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/133703.html" />
    <modified>2008-07-01T23:59:51Z</modified>
    <issued>2008-07-01T17:53:25-06:00</issued>
    <id>tag:blog.lib.umn.edu,2008:/s-hear/control//1227.133703</id>
    <created>2008-07-01T23:53:25Z</created>
    <summary type="text/plain">Most of these are location codes for other libraries which came in as part of records from RLIN. We don&apos;t need them, but they&apos;re hard to get rid of. For more detail, read on....</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Call numbers</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>Most of these are location codes for other libraries which came in as part of records from RLIN. We don't need them, but they're hard to get rid of. For more detail, read on.</p>]]>
      <![CDATA[<p>Aleph uses an expand program to copy the 852 on holdings records linked to a bib into the extended version of the bib record--the one that appears in the lower Cataloging pane when you search, select, and click Show. This 852 is used to create the various call number indexes, and needs to be indexed. At the same time, many of our old RLIN records have retained a bib 852 for the original inputting library. These numbers have converted with the same coding used in our local bib 852s; so these codes appear in the index, too. For example, the code "sml" converted as a Yale sublibrary code, so if you do a LC/NLM Shelflist search for beginning "sml ..." you see lots of Yale 852s. </p>

<p>We don't need to retain these, but neither can we eliminate all the bib 852s, since the local ones are necessary. From time to time Database Management has had a student working at deleting the bib 852s for non-local institution codes, but that takes a certain familiarity with our local codes; and since the public call number indexes are not prone to displaying these, it's a low priority problem.</p>

<p>They really only appear in the Shelflist indexes, where Aleph includes the 852 $b $c in the index string. If the search you do in a shelflist index begins with a local sublibrary/collection code combination (e.g., twils gen ...), these old RLIN codes aren't a problem. They only appear when you look past the numbers beginning with local loc codes. Again, that makes these lower priority for maintenance. Still, it's annoying to see these pop up, and eventually we'd like to get rid of them.</p>]]>
    </content>
  </entry>
  <entry>
    <title>What is NACO normalization? Should I use it?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/089846.html" />
    <modified>2007-09-26T16:47:13Z</modified>
    <issued>2007-09-26T09:41:05-06:00</issued>
    <id>tag:blog.lib.umn.edu,2007:/s-hear/control//1227.89846</id>
    <created>2007-09-26T15:41:05Z</created>
    <summary type="text/plain">NACO normalization is a set of rules for eliminating small differences from character strings. Normalization is good, but NACO normalization isn&apos;t good in all cases. For more information, please click &quot;Continue reading.&quot;...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>General</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>NACO normalization is a set of rules for eliminating small differences from character strings.  Normalization is good, but NACO normalization isn't good in all cases.  For more information, please click "Continue reading."</p>]]>
      <![CDATA[<p>Normalization is the practice of simplifying character strings for comparison and sorting.  Typically normalization removes case differences from letters by making them all capitals or all small letters; removes punctuation that might vary accidentally between two headings for the same thing, or might complicate sorting; changes combined characters and characters marked by diacritics into a simpler, unmarked forms; etc.</p>

<p>NACO normalization is primarily a set of rules for normalizing MARC encoded name and title headings.  This means that NACO normalization goes beyond the rules listed above to include some more specialized rules needed for its particular task.  For example, the first comma in a personal name heading usually marks a distinction between surname and forename, and NACO normalization retains it.  A name heading in inverted form with a comma and in direct order form without a comma are not considered to be the same, e.g., "Chandra, Prakash" and "Chandra Prakāsh" are both valid NACO headings, and the first comma rule allows them to be distinguished after normalization.  </p>

<p>MARC headings are divided into subfields.  In name headings, there are cases where the same data element can be found coded with different subfield values, one current and the other obsolete.  NACO normalization deals with this by removing the subfield value, but leaving the subfield position marked; i.e., $a $b and $c get normalized to $-, $-, and $-.  This means that insignificant or erroneous differences in subfield code values do not affect comparison and sorting, so "$a Microbiology Conference $b (1st ...)" and "$a Microbiology Conference $n (1st ...)" can index the same; likewise "$a Smith, John, $d 1953- " and "$a Smith, John, $c 1953- "</p>

<p>However, outside of MARC name and title headings, the NACO normalization rules do not necessarily lead to good results.  Subject headings sometimes use the same words to mean different things, with the difference signified by the subfield code used.  For example, under current coding practice, "... $x Portraits" and "... $v Portraits" mean different things when they appear in a subject heading string (the former is a work about portraits of a person, while the other is a collection of those portraits).  Treating them as identical is not necessarily the best choice--though neither is indexing them separately with no indication of what the difference between them is, but that's a different discussion.  Similarly,  the first comma in a topical subject string does not mark a valid distinction between to headings; e.g., "Short stories, African" and "Short stories African" would not both be valid, separate headings, and allowing normalization to treat them as identical prevents an inadvertantly omitted comma from disrupting filing order.</p>

<p>There are also kinds of normalization which NACO normalization does not include, but which have proven very useful.  NACO normalization makes no changes to numbers, but there are standardized changes that can be made to numbers in specified subfields that will enable them to file in a numeric rather than a decimal order (i.e., 3, 21, 111 instead of 111, 21, 3).  This type of normalization may be judged appropriate, despite not being part of NACO normalization.</p>

<p>More details about NACO normalization can be found at http://www.loc.gov/catdir/pcc/naco/normrule.html .</p>]]>
    </content>
  </entry>
  <entry>
    <title>How should I cite a name which is used on older LC bib records, but not established?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/045164.html" />
    <modified>2006-05-02T18:41:52Z</modified>
    <issued>2006-05-02T12:10:39-06:00</issued>
    <id>tag:blog.lib.umn.edu,2006:/s-hear/control//1227.45164</id>
    <created>2006-05-02T18:10:39Z</created>
    <summary type="text/plain">Older forms found on LC bib records raise a number of questions. Click &quot;Continue reading ...&quot; below for an extended discussion which covers a number of them....</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Names</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>Older forms found on LC bib records raise a number of questions.  Click "Continue reading ..." below for an extended discussion which covers a number of them.<br />
</p>]]>
      <![CDATA[<p>For 670s generally, you cite the source you saw. When a 670 cites "LC manual cat.", it means that someone at LC actually looked in their old card catalog.  If you found information by looking in the online LC catalog (which I did for the example used here), that can be reported as follows:</p>

<p>670  LC database, May 2, 2006 $b (hdg.: Fisher, Warren Samuel, 1878- ; usage: W.S. Fisher)</p>

<p>Note that we're citing all the information we found in the file related to the heading, not just a single record. (Source for this in Cataloger's Desktop: Descriptive Cataloging Manual: Public Sections/Z1. Name and Series Authority Records/Variable Data Fields/670 Source Data Found/Special types of citations/LC database.)</p>

<p>The first 670 in an authority record should be for the work being cataloged.  In this example, that work shows "Warren S. Fisher."  That usage form combined with the additional information found in the LC database and constructed according to AACR2 gives the heading form</p>

<p>100 1  Fisher, Warren S. $q (Warren Samuel), $d b. 1878</p>

<p>The usage information in the 670 above would justify an added 400:</p>

<p>400 1  Fisher, W. S.$$q(Warren Samuel),$$db. 1878</p>

<p>LC would not make a reference for the unestablished "old catalog" heading "Fisher, Warren Samuel, 1878-", since it's not constructed according to AACR2 rules. It would only be added as a pre-AACR2 reference with "$$w nna" if it had been established with an online authority record in that form. If we submit an authority for Fisher through NACO, it will omit the "Fisher, Warren Samuel, 1878-" form. I know that doesn't make practical sense, since this is the form most likely to be encountered on older catalog records; but that's the rule we work under. Once the bib headings are updated in MNCAT, our need for the pre-AACR2 reference will drop off, too.</p>

<p>Even if you have a source where the author appears as "Warren Samuel Fisher," LC would be disinclined to add a 400 for "Fisher, Warren Samuel, b. 1878", since the rule is to disregard differences past the first forename except in unusual cases when making references; so this reference would be considered--well, not exactly the same as, but insufficiently different from "100 Fisher, Warren S. (Warren Samuel), b. 1878".<br />
</p>]]>
    </content>
  </entry>
  <entry>
    <title>Which libraries hold the analytic volume I&apos;m looking for?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/039773.html" />
    <modified>2006-03-02T16:53:57Z</modified>
    <issued>2006-03-02T08:58:29-06:00</issued>
    <id>tag:blog.lib.umn.edu,2006:/s-hear/control//1227.39773</id>
    <created>2006-03-02T14:58:29Z</created>
    <summary type="text/plain">When you look for holdings for an analyzed volume--i.e., a volume that&apos;s part of a cataloged set or series, but that also has its own catalog record--it matters which place you look for the holdings information in the MNCAT public...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Aleph</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>When you look for holdings for an analyzed volume--i.e., a volume that's part of a cataloged set or series, but that also has its own catalog record--it matters which place you look for the holdings information in the MNCAT public catalog.  The "Availability Link" list on the brief display will generally be more complete than the "Availability" line holdings on the full record display. For details, please read the full entry.</p>]]>
      <![CDATA[<p>I'm learning more about how Aleph works in the process of relinking a number of analytics to their set or serial record, where their item records are stored.  One recent discovery is that the holdings listed for an analyzed volume in the "Availability Link" column in the brief MNCAT public catalog display are more complete than the list of holding locations that appear on the full record display.  Understanding how Aleph stores and links to item records can help explain this difference.</p>

<p>In Aleph, the bibliographic record is linked to other records, including the holdings record and the ADM or "administrative" record.  The holdings record gives information about a location where a copy of the title is held and summary information about what parts of a set or serial are held.  The ADM record is the record to which all the item records for the title are attached.  An item record can also be linked to a particular holdings record.  When such a link is in place, Aleph is able to display holdings just for that location when asked to do so in the public catalog.  Still, all the items for all locations are tied primarily to the ADM record.  Lastly, an analytic bib can be linked via a LKR ("linker") field to an item on a set/serial record.  Each piece can have only one item record.  Aleph's ability to link two bib records to the same item has given us linking options we never had in our old NOTIS system.</p>

<p>When an analyzed volume's bib record is linked to the corresponding item record tied to the corresponding set or serial record, the item information can be displayed in the public catalog when the analytic record is viewed.  There must be a match between the analytic record's holdings record's sublibrary/collection codes and the codes on one of the set/serial record's holdings records; but once that match on sublibrary/location is made, all the items linked to the set/serial ADM record which match the analytic LKR field's enumeration data will be displayed in the brief display's "Availability Link" column.  Each item carries sublibrary location data which it borrows from the holdings record to which it's linked, but every item tied to the ADM whose enumeration values match the LKR field will be displayed; so effectively, items in several locations can appear in the "Availability Link" column even though the analytic bib record has a holdings record for only one of the locations.  </p>

<p>By contrast, when the full bib record is displayed in the OPAC, it shows data only from the holdings records linked to it in the line labeled "Availability."  If only one location has a holdings record, only that location will show in the full display, and only its items can be called up by clicking the link.  Since many analytic records were originally created for a specific collection, many do not have holdings records for all other copies in the system.</p>

<p>So, for the most complete listing of copies held, check the Availability Link column on the brief display.  There can still be other circumstances (e.g., differing Enum values on the items) which will cause one or more location's copies not to display here; but the absence a holdings record tied to the analytic bib record will not be one of them.</p>]]>
    </content>
  </entry>
  <entry>
    <title>Adding death dates</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/037059.html" />
    <modified>2006-02-02T17:52:35Z</modified>
    <issued>2006-02-02T11:46:31-06:00</issued>
    <id>tag:blog.lib.umn.edu,2006:/s-hear/control//1227.37059</id>
    <created>2006-02-02T17:46:31Z</created>
    <summary type="text/plain">The full entry below includes the announcement from LC&apos;s Cataloging Policy and Support Office of how LC will implement the new policy regarding adding death dates to personal name headings with open birth dates, and how we will be implementing...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Names</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>The full entry below includes the announcement from LC's Cataloging Policy and Support Office of how LC will implement the new policy regarding adding death dates to personal name headings with open birth dates, and how we will be implementing the policy locally. </p>]]>
      <![CDATA[<p>Please note two things in the LC announcement below:</p>

<p>1. There are pieces of the process for managing these updates (an RSS feed from OCLC, the list of 300 headings that LC will be doing) that are not yet in place. NACO participants, please refrain from making these death date additions until these pieces appear.</p>

<p>2. LC is limiting this practice in general to cases where a personal name heading already has an open birth date; and that at LC, except for a list of "300 prominent persons" to be posted soon as a special LC project, the practice will be limited to names that come up on current cataloging or conflict resolution. Through our participation in NACO we will be able locally to add death dates to LC/NACO Authority File (LNAF) headings; but I would encourage NACO participants to be similarly conservative in their approach to this until the volume and impact of these changes in the LNAF is clearer.</p>

<p>As these changes occur, Database Management will endeavor to update the authorities in UMN10 and on the corresponding bib records. We still do not have a means of automatically loading or overlaying new or updated individual authorities from LC, and have not yet implemented routine batch updating of the authority records in UMN10. For now, staff should feel free to edit the date fields in personal name authority records when they find that the LNAF record at http://authorities.loc.gov has been updated, or to report such cases to Database Management, where we will also be working to keep up with the changes as they are reported on the OCLC RSS feed.</p>

<p>If you have questions about implementing this revised practice, please let me know. I'll be posting a version of this message to the "Control Freak" authorities blog available via the Communications/Blogs & Mailing Lists link on the Tech Services home page.</p>

<p>Stephen</p>

<p>Date:         Wed, 1 Feb 2006 16:09:27 -0500<br />
From: CPSO <cpso@loc.gov><br />
Subject: [PCCLIST] LC Policy statement on LCRI 22.17 and BFM reminder</p>

<p>The LCRI 2005 Update, no. 3-4 is now available on Cataloger's Desktop.  Included in this update is revised LCRI 22.17, that now contains an option for catalogers to add death dates to personal name headings with open dates.  Although NACO institutions may apply the option (or not) in the manner that best utilizes their cataloging resources, the following information on how the Library of Congress will implement the policy change is provided for your information:</p>

<p>1.  In order to begin application of this policy in a manner that minimizes the impact on LC's catalog (and the ripple effect on catalogs worldwide) and maximizes the efficient use of cataloging resources LC's policy on addition of death dates will be the following:</p>

<p>Unless working on *authorized special projects, LC catalogers are          asked to apply the option to add death dates to existing personal name authority records with open dates only when a personal name heading is being used on bibliographic records currently being cataloged and/or in the course of routine authority file maintenance (e.g., resolving conflicts between different persons). </p>

<p>Once the impact of this change is fully understood, a more liberal policy for adding death dates may be announced in the future.</p>

<p>2.  Please be reminded that dates (either birth or death dates) should not be added to personal name headings on existing authority records that currently do not have any dates, except in cases of conflict, according to current policy.</p>

<p>3.  If a name authority record is being established for the first time, birth and death dates may be added to the heading, even if a personal name heading without dates for the same person exists on bibliographic records in LC's catalog.</p>

<p>Bibliographic File Maintenance for NACO Libraries</p>

<p>As long as the only change to the heading is the addition of a death date, NACO libraries are not required to report these changes to LC.  In response to a request from LC, OCLC has  agreed to provide an RSS feed that will serve as an alert service for authority records to which death dates have been added.  LC and the PCC are grateful to OCLC for providing this service that will greatly assist libraries in the maintenance of their catalogs-- an announcement regarding the details of the service will be forthcoming from OCLC. </p>

<p>Note that if the heading warrants other changes (besides the addition of a death date), and is thus no longer a one-for-one change, then BFM should be reported to the coop liaison as usual (cf. BFM FAQ at: http://www.loc.gov/catdir/pcc/naco/bfmguide.html  This BFM FAQ and other relevant FAQs will be updated to include information in regard to the addition of death dates.</p>

<p>Catalogers are reminded that when making a change to a date in a 1XX all references must reflect the change and all related NARs must also be changed, e.g., name/title NARs, and BFM reported if appropriate. </p>

<p> * CPSO will shortly be announcing a special project to add death dates to the headings of about 300 prominent persons.  To minimize duplication of effort, the announcement will include the names of the persons involved.</p>

<p>Questions, comments, etc. should be sent to cpso@loc.gov </p>]]>
    </content>
  </entry>
  <entry>
    <title>Is ontology overrated?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/022014.html" />
    <modified>2005-11-28T19:13:37Z</modified>
    <issued>2005-05-23T09:50:38-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.22014</id>
    <created>2005-05-23T15:50:38Z</created>
    <summary type="text/plain">Clay Shirky has posted a piece titled &quot;Ontology Is Overrated: Ontologies, Links, and Tags&quot; at http://shirky.com/writings/ontology_overrated.html The piece has attracted some interest among library staff. The rest of this entry is a set of notes in response to points raised...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>General</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>Clay Shirky has posted a piece titled "Ontology Is Overrated: Ontologies, Links, and Tags" at http://shirky.com/writings/ontology_overrated.html  The piece has attracted some interest among library staff. The rest of this entry is a set of notes in response to points raised by Shirky.  Though I agree with his basic point about the viability of uncontrolled user tagging of web content as a means of extending access, I also contend much of his case that ontology is "overrated"  is overstated.  The comments in the extended entry here will be easier to follow if the Shirky piece is read first.</p>]]>
      <![CDATA[<p>It is not universally true that library categorization/classification systems try to anticipate all new categories.  Deweys system was originally designed to contain all possible topics, but LCC and LCSH both were designed to be responsive to literary warrant, i.e., to expand and incorporate new categories over time as they emerge, and NOT to try to map out all knowledge in advance. Even Dewey is responsive to the emergence of new categories of information.  The difficulty Dewey editors have squeezing large new areas of research into narrow predetermined class ranges is evidence of the hazards of attempting too fine an analysis in ones original classification.  But the mistake was Deweys, not LCs.  The LC system is more flexible, though it too has its overcrowded areas.</p>

<p>Noble gases is just an arguable label.  The category is consistent and coherent. The fact that its been arguably labeled doesnt call the success of the categorization/classification into question.</p>

<p>Classification places single items for retrieval.  It also places related items together.  Shirky seems to acknowledge only the former function, and to ignore the latter.</p>

<p>Before making assertions about how Yahoo views its users and how Yahoo perceives reality, it might be good to know more about what constraints their data storage and retrieval software impose on their definitions of categories and assignment of items to them.  The decision to assign categories to a single rather than multiple hierarchies may be a software constraint, not a philosophical position.</p>

<p>The assignment of an object to multiple categories is really managed by subject headings, not the addition of secondary classifications.   Are we going to get to LCSH? Apparently not.</p>

<p>Google search does not categorize at all.  It just retrieves on user-specified terms.  It sorts the results, but the sorting is not categorical.  On the other hand, Googles Directory is classificatory, and works a lot like Yahoo.   (Oh, but fewer people use it, so it must not have any value.)</p>

<p>Decisions about how to build brief representations of objects (i.e., catalog records) because the full content is inaccessible to machine manipulation are different in kind from the decisions made possible when using computers to analyze fully machine readable documents for retrieval.  Its like comparing the physical skills needed for professional football to those needed for PlayStation football. </p>

<p>Search vs. categorization is a false dichotomy.  Presuming that a user preference for search over the categories in a majority of cases proves categories have no use and are not desirable is a false leap of judgment.  If categorization is a bad idea, why does anyone use Yahoo?</p>

<p>The characteristics suggested for domains amenable to classification lean too heavily on one example, and misjudge the value of bringing like but not identical things together within a browsable frame.  It also misplaces the unit with which ontology is concernedi.e., not the elemental components, but the categories within which they fall. Its the logic of ideational collectives into which the elements fall that determines whether classification is useful, not the distinctions between the base level units being organized. </p>

<p>The point made about the loss of nuanced semantics when a preferred term is selected from several candidates is valid, to a point.  But it overreaches in implying that the differences between terms mean that there is no relationship of interest between them.  Someone looking for a conversation with like-minded others might appreciate the distinctions in LiveJournal, but someone researching the range of opinions on a topic would find the data gathering task under this system significantly more laborious.</p>

<p>The notion that classification/categorization is a one-time-for-all activity is false.  LCC adds new class numbers, LCSH adds new terms and revises old ones, DDC has its phoenix schedules.  The fact that these systems are dynamic rather than static does mean that they require more work than the nave might assume, but it does not disprove their utility.  (And Former Soviet Union is primarily a label for a part of the world.  If a better label were found, no massive reshelving at LC would be required to implement it.  Alternatively, if Shirky is suggesting that some other organization and sequencing of materials about the places in the former Soviet Union would be superior, is he not engaging in the dreaded fortune telling himself?)</p>

<p>URLs are also volatile, and can point to things that require payment or authentication to access.  For both of these reasons, user categorizing of links can be a wasted effort of no use to other users.</p>

<p>Many words, even words in natural language, contain an element of classification.  The fact that one can derive sets of similar objects from users natural language terminology does not mean that no classification has been employed.  It just means that the underlying classificatory functions of words have been invoked to derive a set of transpersonal categorizations.</p>

<p>The selectivity of tagged links in del.icio.us gets scant attention.  Given that it only includes URLs that someone has chosen to preserve, is it really a strategy for tagging all URLs? If it were to be extended that way, would it still be as useful?</p>

<p>Its also interesting that no comparison is made between del.icio.uss tagging of URLs by users of web pages vs the metadata tagging of URL-addressed pages by their creators.  Part of the utility of del.icio.us may derive from its aggregation of reviewers rather than creators tags. </p>

<p>Shirky also slights the role of language as a structured system that stands between individuals and the objects they describe.  The rules and conventions of language both enable and constrain both freedom and consensus in the use of terms.  Formal classificatory systems draw on the ability of language to represent logical relationships for shared understanding and use.  The informal ones that Shirky is advocating do the same, but with a lot less discipline and a higher risk of tagged items being unfindable because of idiosyncratic use of terms, or being findable only through a laborious series of searches for multiple terms.</p>

<p>For all the disagreement expressed above, I fully agree with Shirkys point that informal user-driven building of access paths and terminology is the only viable approach to making the bulk of web objects accessible.  I also agree that the consistencies and relationships that emerge when vast amounts of informally generated tagging data are aggregated have great utility.  I just dont think that the alternative methodologies he disparages or their appropriate applications have been treated fairly, or even well understood.  I also find the notion that classification is a kind of arrogant assault on freedom to be tiresome.  Formal classification is a game with rules.  Playing the game means accepting the rules, and thereby getting the benefits of a more disciplined approach to organizing information.  Shirkys boast that end users can apply terminology without having to agree with anyone else about how something should be tagged is analogous to arguing that the umpire has no right to insist on the number of bases that should be tagged before heading for home plate.  Its the attitude behind that sneering should that grates.<br />
</p>]]>
    </content>
  </entry>
  <entry>
    <title>WorldCat says we have this. How come I can&apos;t find it in MNCAT?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/021565.html" />
    <modified>2006-04-05T22:39:49Z</modified>
    <issued>2005-05-11T09:06:45-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.21565</id>
    <created>2005-05-11T15:06:45Z</created>
    <summary type="text/plain">MNCAT is our library of record, not WorldCat. For various reasons, WorldCat is an unreliable guide to what the University Libraries holds. For more detail, read on....</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>General</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>MNCAT is our library of record, not WorldCat.  For various reasons, WorldCat is an unreliable guide to what the University Libraries holds.  For more detail, read on.</p>]]>
      <![CDATA[<p>The libraries do all our original and copy cataloging and catalog maintenance in MNCAT, our local catalog.  We regularly export batches of records to OCLC and RLG, the two bibliographic utilities to which we belong.  However, this does not mean that OCLC's WorldCat and RLG's RLIN database accurately mirror what is in MNCAT.  </p>

<p>OCLC does not load all records immediately.  OCLC's matching algorithm is fairly complex, looking for multiple points of consistency.  When such things as title, year of publication, ISBN/ISSN, publisher, and system number all match up, OCLC promptly and reliably adds our holdings symbol to their WorldCat record.  However, if there are discrepancies, or if the record is new to OCLC, it is shunted off for manual review, which can be very slow.  Consequently, original University of Minnesota records (e.g., for theses) are often not found in WorldCat, or may be long delayed before finally appearing there.  Also, OCLC does not routinely remove our holdings symbol when we include "deleted" records in the batches we send, so many of the titles we appear in WorldCat to hold have actually been withdrawn or lost, as indicated in MNCAT.</p>

<p>Prior to our move to Aleph, RLIN was a more accurate mirror of MNCAT, because RLIN maintains a copy of each RLG member institution's record.  While in NOTIS, we would routinely batch update our bibliographic records in RLIN whenever changes were made to the bib record in MNCAT.  Since moving to Aleph, we have limited RLIN updates to cases where the MNCAT holdings record has been added or modified; so heading corrections no longer prompt an update of the RLIN copy of our record.  Still, RLIN and Eureka are currently a closer reflection of MNCAT than WorldCat.</p>

<p>MNCAT is our library of record.  MNCAT should be consulted to find out what we hold, and where it is shelved.  When WorldCat indicates that we have a record but MNCAT does not, WorldCat is wrong in the vast majority of cases; and many of the titles we do hold do not and may never appear in WorldCat.<br />
</p>]]>
    </content>
  </entry>
  <entry>
    <title>Why am I seeing double form subdivisions?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/020138.html" />
    <modified>2005-11-28T19:10:15Z</modified>
    <issued>2005-04-20T11:32:12-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.20138</id>
    <created>2005-04-20T17:32:12Z</created>
    <summary type="text/plain">Now that authority linking has been turned back on, please be aware of a temporary issue regarding the updating of headings with form subdivisions. Specifically, when you update a record with a form subdivision in $$v and the authority still...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Aleph</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>Now that authority linking has been turned back on, please be aware of a temporary issue regarding the updating of headings with form subdivisions. Specifically, when you update a record with a form subdivision in $$v and the authority still uses $$x for the same form subdivision, Aleph will double the subdivision, e.g., </p>

<p>Authority: 	Children $$x Prayer-books and devotions.<br />
Bibl hdg.: 	Children $$v Prayer-books and devotions.<br />
Upd bibl.:	Children $$x Prayer-books and devotions $$v Prayer-books and devotions.</p>

<p>The simplest solution is to follow the authorized form and use $$x. Eventually, these will be corrected to $$v; and in the meantime, $$v and $$x will interfile happily, so the coding difference will be invisible to users. For a more detailed explanation of the issue, read on.</p>]]>
      <![CDATA[<p>When Aleph checks and finds an authority which matches a bib heading, it replaces the bib heading with the authority heading. This serves to correct minor differences in capitalization, diacritics, punctuation, etc., and is generally a good thing. The problem arises because Aleph's sensitivity to coding variations when checking headings and when updating headings is different. When matching a bib heading to an authority, Aleph normalizes out coding differences (along with minor differences in capitalization, diacritics, punctuation, etc.). When Aleph updates a bib heading, it attends to coding differences; in this case, it sees the authority $$x and the bib $$v as different, and preserves both. This preservation of different subfields is good when the difference is that the bib heading includes subdivisions that aren't in the authority, such as $$e or $$4; but in cases like the one above, it can cause problems.</p>

<p>When Database Management and LEO staff were working through these issues in order to get flipped headings from the V14 to V16 conversion fixed and get automatic updating from authorities turned back on, we determined that in most cases, the doubling of form subdivisions was due to the presence of an updated authority containing $$v and older bib headings containing $$x. Most of our bib form subdivisions are still in $$x. To solve the doubling in these cases, LEO ran a job which identified all the authorities containing $$v and set the UPD=N flag in them. This means that Aleph will not update a bib record which appears to match one of these authorities, and will not mistakenly create doubled form subdivisions.</p>

<p>The other side of this coin is the authorities which still contain form subdivisions coded $$x. LC is gradually working through its authority files creating form subdivision authority records (using tag 185) and revising heading authority records which contain a form subdivision. Our authority file contains a mix of these updated and non-updated authorities. As LC completes its work and as we resume our batch updating of authorities with newer versions from LC, these instances of authorized headings using $$x for form subdivisions will go away. For now, though, we're in an interim phase. (NB: These authorities are exceptions. LC does not routinely create authorities for headings ending in form subdivisions, but it does do so when such a heading has been used as an example or a reference on another authority.)</p>

<p>One good thing about the indexing in Aleph version 16 is that it is NOT sensitive to nearly as many minor differences in headings as version 14 was. In version 14, minor differences in capitalization, diacritics, punctuation, etc., and in coding, would cause separate lines (i.e., split files) to appear in the index. Version 16 is much better at recognizing that such differences still belong to the "same" heading, and is able to represent them with a single merged index entry. Therefore, the urgency of standardizing the coding of form subfields has considerably lessened with version 16</p>

<p>So, back to solutions: the simplest correction is to follow the coding in the authority, even when it is obsolete. Doing so will avoid the subdivision doubling, and will not cause problems in the index. The more involved solution is to revise the coding in the authority to $$v, and add the UPD=N field to avoid causing doubling of older bib headings containing $$x form subdivisions. Database Management will revise any authorities like this that we find or have reported to us during this interim phase. This will allow use of the current $$v coding in the bib record. The long term solution will be to replace these authorities with updates from LC. Our batch updating process will include a step to set such authorities to UPD=N as we load them. Eventually we may update all the older $$x bib form subdivisions to $$v, but that is not a high priority given that they are no longer causing split files.</p>

<p>In any case, it is always a good idea to glance over a bib record after sending it to the server. In most cases any automatic heading updates will be good changes; in some cases, the updated record may reveal a case where an incorrectly coded subdivision (e.g., a place name in $$x rather than $$z) caused the kind of doubling that's been described, and should be corrected; and in some cases, there may be a doubling of something that was correctly coded in the bib record but incorrectly coded in the authority, and therefore has doubled.</p>

<p>If you have any questions about this, please let me know.</p>]]>
    </content>
  </entry>
  <entry>
    <title>Why are there two entries for my title in the index? Can we fix it?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/017610.html" />
    <modified>2005-11-28T19:03:43Z</modified>
    <issued>2005-03-11T14:44:41-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.17610</id>
    <created>2005-03-11T20:44:41Z</created>
    <summary type="text/plain">If the difference involves an initial article, this is an insoluble problem for now, but it may be solved in the future. For more information, read on....</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Aleph</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>If the difference involves an initial article, this is an insoluble problem for now, but it may be solved in the future. For more information, read on.</p>]]>
      <![CDATA[<p>Aleph derives two versions of an access field for browse indexing. One is heavily normalized for rough sorting; the other is more delicately normalized. Aleph uses the latter form to display the heading in the index with punctuation, upper and lower case, etc., which makes the headings more legible. When there are different forms of the delicately normalized heading, Aleph has to decide what to do.</p>

<p>In version 14, Aleph usually decided that it had separate entries.  This meant that a host of small differences--capitalization, punctuation, spacing--could cause what should be a single index entry to split into two or more entries.  Version 16 improves on this significantly--Aleph is now able to accept that these kinds of differences are ignorable, and to display only one of them as representing all. But, in the case of initial articles, Aleph is still unable to see the two entries as the "same." If we could eliminate the initial articles from titles in the bib records, that would solve the problem; but that would also violate standard cataloging rules, which call for including the initial article in 245s, but excluding it in $t's and 130 uniform title entries.</p>

<p>What is needed is for Aleph to take the filing indicator into account when constructing the more delicate filing form.  If it were able to use the filing indicator to offset the initial article correctly, the entries would merge. Possible downsides: no more non-filing initial articles in the index (some like seeing them there); and the possibility that the form selected for display in the index might be one of the 245 forms, which often (and properly) do not uppercase the first filing word.  In any case, these would be smaller problems than the split in the headings we see now.  I expect we'll see a fix for this at some point; but there's nothing we can do now.  The underlying data is correct.</p>]]>
    </content>
  </entry>
  <entry>
    <title>Should I use LCSH terms in my metadata records?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/015877.html" />
    <modified>2005-11-28T19:02:45Z</modified>
    <issued>2005-02-14T12:10:02-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.15877</id>
    <created>2005-02-14T18:10:02Z</created>
    <summary type="text/plain">Librarians usually think of authority control in the context of the library catalog, where an elaborate set of rules and a long history of consensus building and professional discipline have resulted in a relatively high uniformity of practice and understanding....</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>General</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>Librarians usually think of authority control in the context of the library catalog, where an elaborate set of rules and a long history of consensus building and professional discipline have resulted in a relatively high uniformity of practice and understanding.  Other communities, perceiving the value ascribed to authorized vocabularies, have shown an interest in making use of these vocabularies and presumably of adding the value inherent in them to other resource discovery tools and databases.  But where exactly does the added value of the terms in a controlled vocabulary reside?  (What follows focuses on subject vocabularies, but could also be applied to other kinds of controlled vocabularies.)</p>]]>
      <![CDATA[<p>One simple notion is that the added value resides in the terms themselves.  There is some logic to this.  The terms in an authorized list like Library of Congress Subject Headings (LCSH) have been selected from a number of synonyms as being the best in some sense (e.g., most used by practitioners of a field).  Automatic validation of a databases terms against the source list can ensure formal consistency with the approved terminology.  Declaration of a databases terms adherence to a known authorized list can in principle enable searchers looking across databases to use a single controlled vocabulary.  It would appear that all of these good things can be achieved simply by picking terms from an authorized list.  This could be called formal adherence to a controlled vocabulary.</p>

<p>The problem with formal adherence is that it sidesteps the question of what terms mean.  Only the simplest controlled terms lists use terms that are fully and mutually exclusive of one another.  In most cases, the relationships between terms are more complex.  In LCSH, two main devices are used to provide logical hierarchies of terms.  Broad terms are assigned one or more subdivisions to narrow the meaning of a heading.  Related terms are linked by cross references in broader/narrower hierarchies, again to specify a range of broad to narrow meanings.   The specific meaning and correct application of a term in this kind of system depends on an understanding of the systems principle of using the most specific terminology appropriate for describing a particular resource, and of how each term relates to the larger list.  This could be called semantic adherence.  Semantic adherence to a controlled vocabulary goes beyond formal adherence in that it seeks consistency with the meaning of terms as defined in the authorizing source, as well as with their forms.  </p>

<p>Semantic adherence is more complicated that formal adherence.  The specific meanings of terms are not routinely expressed in the authority records which make up a controlled vocabulary, and are rarely conveyed in a simple list of the terms.   They are not typically available to automatic term matching algorithms, and cannot be typically be validated by machine tests.  They may not be understood by users of the list who are not informed about the rules and discipline of the community that prepared the list.  For all these reasons, it is doubtful that simple formal adherence to a controlled vocabulary will equate with semantic adherence to the same vocabulary.  </p>

<p>The question then becomes, how much does the added value of a controlled term depend simply on its form, and how much does it depend on both the terms form and its semantics?  Consider two examples:</p>

<p>The diary of an American Civil War Confederate soldier might be assigned any of the following controlled LCSH headings, all of which are formally valid:<br />
&#61623; United States<br />
&#61623; United StatesHistory<br />
&#61623; United StatesHistoryCivil War, 1861-1865<br />
&#61623; United StatesHistoryCivil War, 1861-1865Personal narratives<br />
&#61623; United StatesHistoryCivil War, 1861-1865Personal narratives, Confederate</p>

<p>A collection of photos of the B-1 bomber might be assigned any of the following controlled LCSH headings, all formally valid:<br />
&#61623; Airplanes<br />
&#61623; Government aircraft<br />
&#61623; Airplanes, Military<br />
&#61623; Bombers<br />
&#61623; Strategic bombers<br />
&#61623; Jet bombers<br />
&#61623; Supersonic bombers<br />
&#61623; B-1 bomber</p>

<p>In the context of a particular database, any of these levels of specificity might seem more or less appropriate for describing the specified content.  However, to be consistent with LCSH semantics and rules of application, only the last and most specific terms would be appropriate.  If semantic adherence is not undertaken by the creators of a set of databases, how much of the value of the controlled terms will be preserved for users attempting to search across those databases with the controlled vocabulary?  </p>

<p>User satisfaction with searching across databases which share only formal adherence to a particular controlled vocabulary  is a question that can only be properly settled by research.  However, in principle the value of controlled terms does not reside simply in their valid form.  It depends also and crucially on their semantics.  The added value of authority control thus depends on:<br />
&#61623; acknowledgment of an authoritative source for the <b>form and meaning </b>of terms<br />
&#61623; discipline in the application of terms<br />
As agencies and database creators seek to expand the utility of controlled vocabularies for users outside their source communities, there is an increased need to assign explicit definitions to authority records and to inform outside users of their rules of application.  This need for more explicitly specific semantics in authority records was recognized by Francoise Bourdon in her book International cooperation in the field of authority data (Saur, 1993) in relation to name authority files; it is just as true for subjects, and for other kinds of controlled vocabularies.</p>]]>
    </content>
  </entry>
  <entry>
    <title>What did Aleph v16 do to our headings???</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/015179.html" />
    <modified>2005-11-28T19:01:34Z</modified>
    <issued>2005-02-03T15:49:00-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.15179</id>
    <created>2005-02-03T21:49:00Z</created>
    <summary type="text/plain">The index regeneration that has done so much good for MNCAT (authorizing many additional headings, correcting problems from the first index gen for v14, finally making thousands of romanized Arabic, Russian, Indic and Icelandic headings searchable) has also in a...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Aleph</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>The index regeneration  that has done so much good for MNCAT (authorizing many additional headings, correcting problems from the first index gen for v14, finally making thousands of romanized Arabic, Russian, Indic and Icelandic  headings searchable) has also in a few cases gone haywire. Aleph is not sensitive to tag differences; for example, a 400 or 430 reference can match to a 150 heading, resulting in the replacement of the initial subfield(s) of a topical heading with the 100 or 130 data from the mismatched authority. Not good; but fixable.</p>]]>
      <![CDATA[<p>Some of these bad heading flips were present but undetected in the v14 MNCAT.  Others may have newly occured because the updated authority file included new authority records, or because the internal sequence numbers of the authority records changed. Aleph's heading checker essentially accepts the first authority guidance it encounters. Hence, in v16 Aleph may have encountered the wrong authority first, where in v14 it happened to encounter the correct authority first.</p>

<p>Broadly speaking, these changes fall into two types. In some cases, a 4XX reference flips all instances of an unrelated heading to match its 1XX, and there are no correct uses of the 1XX heading in MNCAT. These are relatively easy to fix using either the Correct button or an Aleph batch job.  More problematic are cases where the flipped headings get merged into a larger batch of correct MNCAT headings.  For example, the LC children's subject authorities simplify some LCSH headings by replacing them with a single heading, which also happens to be an LCSH heading. The LC children's subject authority combines the LCSH headings "Evolution" and "Human evolution" into the single heading "Evolution." Heading flips of this kind can be very hard to untangle.</p>

<p>Fortunately, LEO has been able to run a job comparing the v16 headings with their v14 counterparts and listing all those which were flipped, including each bib record number. This list will be of great aid in identifying which headings need to be corrected, and which bib records were affected in those cases where headings were incorrectly merged.  LEO and TS are also working on measures to ensure that these kinds of flips will not continue to be a problem. </p>

<p>So, what should you do? For now, if you encounter an out-or-place heading, please report it to fixit@umn.edu . If you want to understand how things went wrong in more detail and fix a problem yourself, look for an authority record matching the base of the out-of-place heading. Check its 4XX fields--could one of them have matched the base of a more appropriate heading for the bib record? If so, that's the culprit.  </p>

<p>Two changes are usually necessary in the authority record before the bib records can be fixed. Call up the problem authority in the Cataloging Record display. Extend the 4XX with some reasonable text (e.g., add a parenthetical qualifier) to get past a mandatory block on updating an authority with a 4XX matching another authority's 1XX; and add a UPD field to block future use of the record for updating, e.g., "UPD __ $$a NO" Once the new update status is reflected in authority link text in the UMN01 index, the bib headings can be safely corrected.</p>]]>
    </content>
  </entry>
  <entry>
    <title>What does the Correct Display button do?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/015096.html" />
    <modified>2005-11-28T19:01:27Z</modified>
    <issued>2005-02-02T11:09:07-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.15096</id>
    <created>2005-02-02T17:09:07Z</created>
    <summary type="text/plain">The purpose of the on-screen &quot;Correct Display&quot; button, new in Aleph version 16, was a bit obscure to me, but now I think I understand it....</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>Aleph</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>The purpose of the on-screen "Correct Display" button, new in Aleph version 16, was a bit obscure to me, but now I think I understand it.</p>]]>
      <![CDATA[<p>In v14, Aleph was hypersensitive to small differences in the display form of its headings, to the point that minor differences in capitalization or punctuation could split a heading into two entries. In v16, this sensitivity is reduced, which is good on the whole and relieves us of lots of work to merge split headings. But a consequence of this reduction in sensitivity has been a break in the link between the heading form on the record and the display form in the index entry as regards these minor differences. This comes through most clearly when one is correcting a minor discrepancy in a single entry, for example:</p>

<p>"Biology--Polar Regions" should be "Biology--Polar regions."</p>

<p>There is only one hit for this heading. The Correct Heading button lets me change the bib heading to "regions", but the display form of the heading is not changed--it still shows "Regions." To complete the correction, I use the Correct Display button. It lets me doctor the display text in the index record to the form I want. Having taught Aleph not to regard differences like R/r as significant enough to result in a separate display, we also have to accept that when we make such a change, Aleph will ignore it unless we intervene with the Correct Display button.</p>

<p>So my guess is that we'll use the Correct Display button mainly to tidy up the index after using the Correct Heading button to correct the bib data. If we used only the Correct Display button, the entry would look OK, but the underlying bib data would be uncorrected. </p>

<p>Please note that only a few staff are authorized to use the Correct Heading and Correct Display buttons.  If you have corrections which would best be done with these buttons, please report them to fixit@umn.edu<br />
</p>]]>
    </content>
  </entry>
  <entry>
    <title>To BFM, or not to BFM?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/014959.html" />
    <modified>2005-11-28T19:01:09Z</modified>
    <issued>2005-01-31T14:39:24-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.14959</id>
    <created>2005-01-31T20:39:24Z</created>
    <summary type="text/plain">Participants in the cooperative NACO authority project are required to notify Library of Congress when the process of creating or modifying an authority record results in a need for bibliographic file maintenance (BFM) in LC&apos;s catalog, i.e., the correction of...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>NACO</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>Participants in the cooperative NACO authority project are required to notify Library of Congress when the process of creating or modifying an authority record results in a need for bibliographic file maintenance (BFM) in LC's catalog, i.e., the correction of a heading on one or more LC bib records.  Recent changes have reduced the number of cases which require this kind of reporting. [More in the extended entry].</p>]]>
      <![CDATA[<p>The two major categories which no longer need to be reported are changes which effectively replace one version of a heading with another--e.g., a change which modifies a 100 heading; and changes which affect older records which LC does not distribute as MARC21 records.</p>

<p>The former category is discussed in detail in a FAQ posted to the NACO web site at: </p>

<p>http://www.loc.gov/catdir/pcc/naco/bfmguide.html  </p>

<p>The site offers examples of the kinds of changes that need to be reported (e.g., two authors whose titles have been confused under a single name heading), and those which don't.</p>

<p>The second category involves records outside the scope of the BFM rule. Some are marked with the legend [from old catalog] in LC's browse indexes (at catalog.loc.gov). Other older records when viewed in LC's web catalog as MARC format records show in the 906 field the words "OCLC replacement." The answer to question 4 in the NACO FAQ on older records at:</p>

<p>http://www.loc.gov/catdir/pcc/naco/bfmfaq.html#4</p>

<p>instructs us that neither of these are candidates for BFM because they haven't been distributed by LC. You may find versions of older catalog records in OCLC flagged "D" (i.e., based on LC cataloging), but that's because some other library did copy cataloging from an LC card and correctly put DLC in the 040 $a; they're not actually distributed by LC as MARC21 records.</p>

<p>Of course, there is also an element of frustration in seeing a mistake and being asked not to report it; but that's the rule for now.</p>]]>
    </content>
  </entry>
  <entry>
    <title>What is authority control?</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/014058.html" />
    <modified>2005-11-28T18:58:56Z</modified>
    <issued>2005-01-18T16:15:24-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.14058</id>
    <created>2005-01-18T22:15:24Z</created>
    <summary type="text/plain">Authority control is a system librarians have devised to manage access to the records for their collections. When a number of items are held in a collection that have a common characteristic--same author, same topic, same series, versions of the...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>General</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p>Authority control is a system librarians have devised to manage access to the records for their collections.  When a number of items are held in a collection that have a common characteristic--same author, same topic, same series, versions of the same work--users of the collection may want to see the records for those items brought together.  In online catalogs, authority control uses a separate file of records called authority records (distinct from bibliographic records, which represent items in the collection).  The authority file defines  uniform headings for each of these common characteristics--names, subjects, series, and entries for works.  Also needed are rules for how to create and use these headings, and system functions to link the authority records to the bibliographic records and provide an integrated display of information from both.  Authority records also enable the library to provide entries for variants of the uniform heading, and to indicate relationships between headings.</p>]]>
      <![CDATA[<p>For example, if catalog users are searching for works by F. Scott Fitzgerald, they can enter a search for either "fitzgerald f scott" or "fitzgerald francis scott". Both searches will find the heading in the catalog where the author's works are listed: Fitzgerald, F. Scott (Francis Scott), 1896-1940.  Authority control ensures that the only one form of the heading is used, and that it can still be searched in a number of ways.  Similarly, a search on "fbi" will bring up a link to the heading for the body: United States. Federal Bureau of Investigation.</p>

<p>Authority records can also express relationships between headings.  The authority for the FBI also points to a heading for the body's earlier name: United States. Dept. of Justice. Division of Investigation.  The authority record for the topic Ostriches also points to broader and related topics: Birds, Ratites, and Cookery (Ostrich).</p>

<p>Often, authorized headings include more information than anyone would search for.  This enables the system to display a clear and distinct heading in response to a partial or truncated search (e.g., "fitzgerald f s" leads to the full name heading above), and to distinguish between two similar entries (e.g., Wagner, Richard, 1813-1883 (for the composer) and Wagner, Richard, 1939-1972 (a dancer and choreographer).</p>]]>
    </content>
  </entry>
  <entry>
    <title>Welcome</title>
    <link rel="alternate" type="text/html" href="http://blog.lib.umn.edu/s-hear/control/014053.html" />
    <modified>2005-11-28T18:58:56Z</modified>
    <issued>2005-01-18T15:38:27-06:00</issued>
    <id>tag:blog.lib.umn.edu,2005:/s-hear/control//1227.14053</id>
    <created>2005-01-18T21:38:27Z</created>
    <summary type="text/plain">Control freak is a blog devoted to library catalog authority control. It features answers to questions that I get about specific authority control issues, rules, and functions, and occasional comments on the state of the art. Additional questions and comments...</summary>
    <author>
      <name>s-hear</name>
      
      <email>Stephen.S.Hearn-1@tc.umn.edu</email>
    </author>
    <dc:subject>General</dc:subject>
    <content type="text/html" mode="escaped" xml:lang="en" xml:base="http://blog.lib.umn.edu/s-hear/control/">
      <![CDATA[<p><b>Control freak</b> is a blog devoted to library catalog authority control. It features answers to questions that I get about specific authority control issues, rules, and functions, and occasional comments on the state of the art. Additional questions and comments are always welcome.</p>

<p>Stephen</p>]]>
      
    </content>
  </entry>

</feed>