Data file

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

A Data file is a computer file which stores data to be used by a computer application or system. It generally does not refer to files that contain instructions or code to be executed (typically called program files), or to files which define the operation or structure of an application or system (which include configuration files, etc.); but specifically to information used as input, or written as output by some other software program. This is especially helpful when debugging a program.

Most computer programs work with files. This is because files help in storing information permanently. Database programs create files of information. Compilers read source files and generate executable files. A file itself is an ordered collection of bytes stored on a storage device like tape, magnetic disk, optical disc etc.

Types of Data files[edit]

Data files can be stored in two ways:

  • Text files.
  • Binary files.

A text file (also called ASCII files) stores information in ASCII characters. A text file contains human-readable characters. A user can read the contents of a text file or edit it using a text editor. In text files, each line of text is terminated, (delimited) with a special character known as EOL (End of Line) character. In text files some internal translations take place when this EOL character is read or written.[1]

Examples of text files[edit]

  • A text document

A binary file is a file that contains information in the same format in which the information is held in memory i.e. in the binary form. In binary file, there is no delimiter for a line. Also no translations occur in binary files. As a result,binary files are faster and easier for a program to read and write than the text files. As long as the file doesn't need to be read or need to be ported to a different type of system, binary files are the best way to store program information.[1]

Examples of binary files[edit]

  • A JPEG image

Data file categories[edit]

Closed data file formats, frequently referred to as proprietary format files, have their metadata data elements hidden, obscured or unavailable to users of the file. Application developers do this to discourage users from tampering with or corrupting the data files or importing the data into a competitor's application.

Open data format files have their internal structures available to users of the file through a process of metadata publishing. Metadata publishing implies that the structure and semantics of all the possible data elements within a file are available to users.

Examples of open data files include CSV, XLS and XML formats such as HTML for storing web pages or SVG for storing scalable graphics.[2]

See also[edit]

References[edit]

  1. ^ a b Making sense of file transfers: City edition. (1994, ). Boston Globe (Pre-1997 Fulltext)
  2. ^ Research Data Management Service Group. (n.d.). File Formats. Cornell University. Retrieved from https://data.research.cornell.edu/content/file-formats