Bibliomining is a term that combines the concepts of data mining , data storage and bibliometrics for analyzing library services. The term was introduced in 2003 by Scott Nicholson, associate professor of information research at Syracuse University, in order to distinguish data mining in a library from other types of intellectual data analysis.
Content
The principle of the work of bibliomining
First you need to create a data warehouse . This is done by compiling information about resources such as titles and authors, subject headings, and descriptions of collections. Then demographic surrogate information is organized. This can be information about the library (for example, about the librarian or about the location of the library, regardless of whether information came from the help desk).
Once this is organized, the data can be processed and analyzed. This can be done using several methods, such as online analytical processing , using a data mining program or data visualization .
Application of bibliomining
Bibliomining is used to identify what people read and research, and allows librarians to better orient their community. Bibliomining can also help library directors focus their budgets on the resources to be used. Another possible case is to determine when people more often use the library so that staff needs can be adequately met. Combining bibliomining with other research methods such as focus groups, surveys, and cost-benefit analysis will help librarians better understand their patrons and their needs.
Problems
There is concern that intelligent analysis violates the confidentiality of cartridges. But, removing the data, all personal data is deleted, and the data storage is clean. The original cartridge data can then be completely deleted and it will not be possible to associate new data with a specific cartridge. This can be done in several ways. The first one, used with information on access to the database, is designed to track the IP address, then it needs only to be replaced with a similar code that will allow identification without violating confidentiality. The second way is to track the item returned to the library and create a “demographic surrogate” for the patron. A demographic surrogate would not give any identifiable information, such as the names, numbers, or addresses of libraries.
Another problem with bibliomining is that it only provides data in a very remote form. Information is provided on how the cartridge uses the resources of the library, but there is no way to track whether it meets the user's needs completely. Someone may take a book on the topic, but not find the information they were looking for. Bibliomining helps determine which books are used, but not how useful they are. Bibliomining cannot provide information about how well the collection serves the patron. To counteract this, bibliomining should be used in accordance with other research methods.
Links
1. Nicholson, S. (2006). The Basics for Bibliomining: Usage-Based Data Mining and Bibliometrics through Digital Library Services. Information Processing and Management 42 (3), 785-804.
2. Nicholson, S. (2003). Decision-Making Information Technology and Libraries 22 (4), 146–151.
3. Jiann-Cherng, S. (2009). The Librarians' Integration System for Librarians , Asia-Pacific Conference on Library & Information Education & Practice.
4. Gunther, K. (2000). You can make better decisions. Computers in Libraries 20 (4), 60-63.