Glossary A-J

Acquisition

(Collection) The process of collecting electronic data and paper.

AIIM

The Enterprise Content Management Association has been a neutral and unbiased source for helping individuals and organizations understand the challenges associated with managing documents, content, records, and business processes.

Attachment

Also often called the ""child"" - An attachment is a file associated with another file for the purpose of storage or transfer. There can be multiple attachments associated with a single “parent” file. The parent and child(ren) attachments can be processed or categorized individually or as a set.

AVI

(Audio/Video Interleaved) - A common video file format (.avi). Video quality can be good at smaller resolutions, but files tend to be large.

Best Practice

A technique or methodology that, through experience and research, has proven to reliably lead to a desired result. A commitment to using the best practices in any field is a commitment to using all the knowledge and technology at one's disposal to ensure success.

Blowbacks

A printout onto paper from an image file.

Business Process Management

(BPM) is a systematic approach to improving an organization's business processes. BPM activities seek to make business processes more effective, more efficient, and more capable.

Change Management

A systematic approach to dealing with change, both from the individual's perspective and from that of an organization.  For an organization, change management means defining and implementing procedures and/or technologies to deal with changes in the business environment and to profit from changing opportunities.

Coding Manual

Document that defines the protocols for entering fielded data. The coding manual is the ""rule book"" for the project and generally includes the answers to potential questions.

Compression

Data files available for upload and download are often compressed in order to save space and reduce transfer times. Typical file extensions for compressed files include .zip (DOS/Windows) and .tar (UNIX).

Controlled Language

Also known as ""controlled vocabulary"" this refers to the use of standard terms throughout a RIM program.

Cost Center

A segment of an organization that does not produce direct profit and adds to the cost of running a company.

Data Mining

Sorting through data to identify patterns and establish relationships.

Data Preprocessing

(Prepping) Any action performed on raw data to prepare it for another processing procedure. Commonly used as a preliminary data mining practice, data preprocessing transforms the data into a format that will be more easily and effectively processed for the purpose of the user.

Data Quality

The overall reliability and effectiveness of data, particularly in a data warehouse.

Data Repository

A location or system where knowledge or data is developed, stored, organized, processed, and disseminated.

Data Set

A named collection of data that contains individual data units organized (formatted) in a specific, IBM-prescribed way and accessed by a specific access method that is based on the data set organization.

Data Structure

A specialized format for organizing and storing data. General data structure types include the array, the file, the record, the table, the tree, and so on. Any data structure is designed to organize data to suit a specific purpose so that it can be accessed and worked with in appropriate ways.

Database

A collection of information that is organized so that it can easily be accessed, managed, and updated. In one view, databases can be classified according to types of content: bibliographic, full-text, numeric, and images.

De-Duplication

The process of removing or suppressing duplicate files from a collection of documents prior to reviewing or moving to storage.  Prior to this process, a clear definition of what is a duplicate must be determined - same HASH value and/or other common fields of metadata.

Disaster Recovery Plan

(DRP) - A descriptive document that outlines how an organization would deal with potential disasters, usually consisting of the precautions taken so that the effects of a disaster would be minimized. The goal of the plan is for the organization to be able to either maintain or quickly resume mission-critical functions.

Document

Federal Rules of Civil Procedure 34(a) defines a document as “including writings, drawings, graphs, charts, photographs, phonorecords, and other data compilations".

ESI

Encompasses all electronically stored information.

ETL

In managing databases, extract, transform, load (ETL) refers to three separate functions combined into a single programming tool. First, the extract function reads data from a specified source database and extracts a desired subset of data. Next, the transform function works with the acquired data - using rules or lookup tables, or creating combinations with other data - to convert it to the desired state. Finally, the load function is used to write the resulting data (either all of the subset or just the changes) to a target database, which may or may not previously exist.

External Storage

All addressable data storage that is not currently in the computer's main storage or memory. Synonyms are auxiliary storage and secondary storage.

Failover

A backup operational mode in which the functions of a system component (such as a processor, server, network, or database, for example) are assumed by secondary system components when the primary component becomes unavailable through either failure or scheduled down time.

FTP

(File Transfer Protocol) - A common method of moving files between two Internet sites. Most FTP sites require a login name and password before files can be retrieved or sent.

GIF

(Graphics Interchange Format) - A graphics file format commonly used on the Internet to provide images on Web pages. GIF images are 8-bit (256-color) graphics.

Gigabyte

A thousand (technically 2^10 or 1024) Megabytes.

HTML

(HyperText Markup Language) - The language used to build hypertext documents on the World Wide Web. They are plain ASCII-text documents interpreted (or rendered) by a web browser to display formatted text and fonts, color, graphic images, and links.

Image Compression

The process of minimizing the size in bytes of a graphics file without degrading the quality of the image to an unaccceptable level. The reduction in file size allows more images to be stored in a given amount of disk or memory space. It also reduces the time required for images to be sent over the Internet or downloaded from Web pages.

ISO/IEC 17799

Code of Practice for Information Security Management is a generic set of best practices for the security of information systems. Considered the foremost security specification document in the world, the code of practice includes guidelines for all organizations, no matter what their size or purpose.

Java

A network-oriented programming language invented by Sun Microsystems specifically designed for creating programs that can be downloaded to your computer from a web page and immediately run. Using small Java programs (""applets""), Web pages can include features such as animations, calculators and other fancy or interactive tricks.

JPG

(Joint Photographic Experts Group) - The name of the committee that designed the photographic image-compression standard. The format (.jpg) is optimized for compressing full-color or grayscale photographic images, and does not work well for line drawings or black-and-white images. JPG images are 24-bit (16.7 million color) graphics.