Monday, August 13, 2007
"Individual," "Cultural," and "Institutional" Categorization
Paul, Teenie, Larry and I pointed out that in today's world of ubiquitous computing and ubiquitous information resources, we interact daily as individuals and as participants in organizational processes with a bewildering variety of information types, and we constantly make choices about whether and how to categorize them. So we're proposing to broaden the scope of research on categorization to study the explicit activities by individuals to classify web resources (e.g., flickr, del.icio.us, …) and institutional efforts to define and deploy category systems to achieve business and organizational objectives.
Our fundamental claim is that these different kinds of categorization and classification activities or systems lie in a continuous multidimensional space where we can identify three important regions:
Cultural Categorization Systems
Individual Categorization (aka "Tagging")
Institutional Categorization (aka "Business Semantics")
CULTURAL categorization systems are the traditional subject matter for research. These are acquired implicitly through development via parent-child interactions, language, and experience. Formal education can build on this, but the non-formal cultural system can often dominate.
INDIVIDUAL categorization systems are developed by an individual for organizing a personal domain to aid memory, retrieval, or usage. These can serve social goals to convey information, develop a community, or manage reputation. Individual categorization systems have always existed, but they have exploded with the advent of cyberspace, especially in applications based on "tagging."
INSTITUTIONAL categorization systems involve the explicit construction of a semantic model of a domain to enable more control, robustness, and interoperability than is possible with just the cultural system. They are often the collaborative artifact of many individuals who represent different organizational or business perspectives, and they are usually developed via rigorous and formal processes (e.g., in standards organizations like OASIS, where I'm a member of the Board of Directors). Finally, institutional categorization systems require ongoing governance and maintenance because of continuous changes taking place in related cultural and individual systems.
We frankly admit that our thinking isn't fully developed, but it seems that there are many very interesting and important issues to study when you take this broader view of categorization. In particular, we see a number of dimensions or tradeoffs that define the space of categorization activities, such as:
Explicitness vs implicitness
Effort to acquire and use
Individual vs group goals
Amount of reuse of other categorization systems
Nature and rate of change over time
This fall in my Information Organization and Retrieval course at UC Berkeley, I'll be using this new framework, and I think it will help students understand better how tagging by individuals in flickr or del.icio.us compares to the "institutional tagging" of business information in standard product classification systems like the United Nations Standard Products and Services Code (UNSPSC) or business vocabularies like the Universal Business Language (UBL).
- Bob Glushko