What is the Difference Between a Concatenated Data Format and a Related Data Format?
Archaeological data are hierarchical. Consider a ceramic sherd. The sherd has properties of it own: a single ware type, a single maximum thickness, and a single weight. But that sherd also may carry multiple decorative elements that are executed in a different design, color, and technique. Or consider the fill of a post hole. That fill has one length, width and depth. But it may also have multiple sediment inclusions whose size and frequency are important.
These are examples of one-to-many relationships. One sherd may have many decorative attributes. Posthole fill may have multiple sediment inclusions. Archaeological data, both artifactual and contextual, are filled with these one-to-many relationships. Relational databases were invented to handle just such hierarchical relationships among objects. They do so by recording variable values for objects at each level in the hierarchy in a separate table and by identifying which constituent objects belong to which higher-level objects by means of a unique identifier or “key”.
CMAP data structures take full advantage of the relational database model. This means that when data on both objects and their lower-level constituents are joined and extracted for analysis, the resulting data file may contain multiple records for a single object. Together, the multiple records contain the information on the constituents. However, they duplicate information on the primary object. The Artifact ID number is a good way to see whether the multiple records represent a single artifact.
This example shows how CMAP returns multiple records for a single artifact. Note the differences in Decorative Technique + Dech Tech Color:
Artifact ID | Count | Ware | Form | Sherd Thickness | Decorative Technique | Dec Tech Color |
---|---|---|---|---|---|---|
1000-177 | 1 | Slipware, North Midlands/Staffordshire | Unid: Utilitarian | 3 | Slip | Yellow-Red, Muted Dark |
1000-177 | 1 | Slipware, North Midlands/Staffordshire | Unid: Utilitarian | 3 | Slip | Yellow, Muted Medium |
1000-178 | 1 | Whieldon-type Ware | Unid: Utilitarian | 4 | Applied Powder/Crystals | Yellow, Muted Medium |
1000-178 | 1 | Whieldon-type Ware | Unid: Utilitarian | 4 | Applied Powder/Crystals | Green-Yellow, Muted Dark |
1000-178 | 1 | Whieldon-type Ware | Unid: Utilitarian | 4 | Applied Powder/Crystals | Yellow-Red, Muted Dark |
1000-178 | 1 | Whieldon-type Ware | Unid: Utilitarian | 4 | Molded | No Glaze/Color |
This result may present difficulties for users who traditionally expect a one-to-one relationship between observations or records in a data file and higher-level objects (e.g. one sherd generates one record). For other users, preserving patterns of covariation among variables that characterize lower level constituents is analytically important, and this requires multiple records per object.
We have thus provided users with two different options for viewing and downloading data: a Concatenated Data Format and a Related Data Format. If users choose to download data in the Related Data Format, multiple records will be returned for every artifact that has more than one constituent object or part. Below is an example showing two different glass sherds. Note that there is more than one record for each artifact since each unique glass bottle constituent body element is being recorded (note the different artifact ids, 1000-109 and 1000-110).
Artifact ID | Glass Form | Glass Bottle Element | Glass Bottle Manu Technique | Glass Bottle Shape | Glass Bottle Treatment |
---|---|---|---|---|---|
1000-109 | Bottle, Wine style | Body Horizontal | Mouth Blown | Circular/Round | Not Applicable |
1000-109 | Bottle, Wine style | Base | Mouth Blown | Dome | Not Applicable |
Artifact ID | Glass Form | Glass Bottle Element | Glass Bottle Manu Technique | Glass Bottle Shape | Glass Bottle Treatment |
---|---|---|---|---|---|
1000-110 | Bottle, Wine style | Lip | Finishing Tool | Rounded | Not Applicable |
1000-110 | Bottle, Wine style | String Rim | Finishing Tool | V-Shaped | Not Applicable |
1000-110 | Bottle, Wine style | Finish | Finishing Tool | 2-Part Finish, Unid | Not Applicable |
1000-110 | Bottle, Wine style | Neck | Mouth Blown | Cylindrical | Not Applicable |
Users who select to view and download data in the Concatenated Data Format will receive data sets that do not contain multiple records for a single record. Below is an example of the same artifacts returned in Concatenated Data Format. One record is returned for each artifact and the unique attributes are concatenated into one field, with each term separated by a comma and each record separated by a semi-colon.
Artifact ID | Glass Bottle Form | Glass Bottle Element Manufacturing Technique, Shape, Treatment |
---|---|---|
1000-109 | Bottle, Wine style | Body Horizontal, Mouth Blown, Circular/Round, Not Applicable; Base, Mouth Blown, Dome, Not Applicable. |
1000-110 | Bottle, Wine style | Lip, Finishing Tool, Rounded, Not Applicable; String Rim, Finishing Took, V-Shaped, Not Applicable; Finish, Finishing Tool, 2-Part, Not Applicable; Neck, Mouth Blown, Cylindrical, Not Applicable. |