Level 60 Level 62
67 words 0 ignored
Ready to learn Ready to review
Check the boxes below to ignore/unignore words, then click save at the bottom. Ignored words will never appear in any learning session.
Three basic characteristics of Big Data databases: volume, velocity, and variety.
American National Standards Institute (ANSI)
The group that accepted the DBTG recommendations and augmented database standards in 1975 through its SPARC committee.
A characteristic of an entity or object. An attribute has a name and a data type.
A movement to find new and better ways to manage large amounts of Web-generated data and derive business insight from it, while simultaneously providing high performance and scalability at a reasonable cost.
A description of a policy, procedure, or principle within an organization. For example, a pilot cannot be on duty for more than 10 hours during a 24-hour period, or a professor may teach u…
A class is the blueprint from which individual objects are created. It is the code template you write.
A diagram used to represent data and their relationships in UML object notation.
class diagram notation
The set of symbols used in the creation of class diagrams.
The organization of classes in a hierarchical tree in which each parent class is a superclass and each child class is a subclass.
One of three types of nodes used in the Hadoop Distributed File System. The client node acts as the interface between the user application and the HDFS.
The output of the conceptual design process. The conceptual model provides a global view of an entire database and describes the main data objects, avoiding details.
A representation of the conceptual model, usually expressed graphically.
The classification of the relationship between entities. Classifications include 1:1, 1:M, and M:N.
A restriction placed on data, usually expressed in the form of rules.
Crow's Foot notation
A representation of the entity relationship diagram that uses a three-pronged symbol to represent the many sides of the relationship.
data definition language (DDL)
The language that allows a database administrator to define the database structure, schema, and subschema.
data manipulation language (DML)
The set of commands that allows an end user to manipulate the data in the database. The commands include SELECT, INSERT, UPDATE, DELETE, COMMIT, and ROLLBACK.
A representation, usually graphic, of a complex real-world data structure. Data models are used in the database design phase of the Database Life Cycle.
The process of creating a specific data model for a determined problem domain.
A person, place, thing, concept, or event for which data can be stored.
In ER modeling, a specific table row.
entity relationship (ER) model (ERM)
A data model that describes relationships, 1:1, 1:M, and M:N, among entities at the conceptual level with the help of ER diagrams. The model was developed by P. Chen in 1975.
entity relationship diagram (ERD)
A diagram that depicts an entity relationship model's entities, attributes, and relations.
In a relational model, a grouping of related entities.
A model for database consistency in which updates to the database will propagate through the system so that all data copies will be consistent eventually.
extended relational data model (ERDM)
A model that includes the object-oriented model's best features in an inherently simpler relational database structural environment.
Extensible Markup Language (XML)
A metalanguage used to represent and manipulate data elements. Unlike other markup languages, XML permits the manipulation of a document's data elements. XML facilitates the exchange of structured documents such as orders and invoices over the Internet.
The application programmer's view of the data environment. Given its business focus, an external model works with a data subset of the global database schema.
The specific representation of an external view; the end user's view of the data environment.
A Java based, open source, high speed, fault-tolerant distributed storage and computational framework. Hadoop uses low-cost hardware to create clusters of thousands of computer nodes to store and process data.
Hadoop Distributed File System (HDFS)
A highly distributed, fault-tolerant file storage system designed to manage large amounts of data at high speeds.
A condition in which a model does not depend on the hardware used in the model's implementation. Therefore, changes in the hardware will have no effect on the database design at the conceptual level.
An early database model whose basic concepts and characteristics formed the basis for subsequent database development. This model is based on an upside-down tree structure in which each record is called a segment. The…
designer, project manager & programmer are all based on and extend a person object
In database modeling, a level of data abstraction that adapts the conceptual model to a specific DBMS model for implementation. The internal model is the representation of a database as seen by the DBMS…
A representation of an internal model using the database constructs supported by the chosen database.
A data model based on a structure composed of two data elements: a key and a value, in which every key has a corresponding value or set of values. The key-value data model is …
A stage in the design phase that matches the conceptual design to the requirements of the selected DBMS and is therefore software-dependent. Logical design is used to translate the conceptual design into the internal m…
A condition in which the internal model can be changed without affecting the conceptual model.
many-to-many (M:N or *..*) relationship
Associations among two or more entities in which one occurrence of an entity is associated with many occurrences of a related entity and one occurrence of the related entity is associated with many occurrences of the first entity.
An open-source application programming interface that provides fast data analytics services; one of the main Big Data technologies that allows organizations to process massive data stores.
A sequence of statements that has a name, may have parameter variables, and may return a value
One of three types of nodes used in the Hadoop Distributed File System. The name node stores all the metadata about the file system.
A data model standard created in the late 1960s that represented data as a collection of record types and relationships as predefined sets with an owner record type and a member record type in a 1:M relationship.
__________ is the term used to describe non-relational databases that support very high transaction rates processing relatively simple data structures.
A database item that contains data, as well as the actions that read or process the data.
object-oriented data model (OODM)
A data model whose basic modeling structure is an object.
object-oriented database management system (OODBMS)
Data management software used to manage data in an object-oriented database model.
object/relational database management system (O/R DBMS)
A DBMS based on the extended relational model. The ERDM, championed by many relational database researchers, constitutes the relational model's response to the OODM. This model includes many of the object-oriented model's best features wi…
one-to-many (1:M or 1..*) relationship
Associations among two or more entities that are used by data models. In a 1:M relationship, one entity instance is associated with many instances of the related entity.
one-to-one (1:1 or 1..1) relationship
Associations among two or more entities that are used by data models. In a 1:1 relationship, one entity instance is associated with only one instance of the related entity.
A condition in which the physical model can be changed without affecting the internal model.
A model in which physical characteristics such as location, path, and format are described for the data. The physical model is both hardware- and software-dependent.
a pairing that matches each element of the domain with one or more images in the range
relational database management system (RDBMS)
A collection of programs that manages a relational database. The RDBMS software translates a user's logical requests into commands that physically locate and retrieve the requested data. A good RDBMS also creates and maintai…
A graphical representation of a relational database's entities, the attributes within those entities, and the relationships among the entities.
Developed by E. F. Codd of IBM in 1970, it represented a major breakthrough for users and designers because of its conceptual simplicity. The relational model is based on mathematical set theory and represents …
A blank within the data in a relational database.
A logical grouping of database objects, such as tables, indexes, views, and queries, that are related to each other. Usually, a schema belongs to a single user or application.
In the hierarchical data model, the equivalent of a file system's record type.
semantic data model
The first of a series of data models that more closely represented the real world, modeling both data and their relationships in a single structure known as an object. The SDM, published in 1981,…
A property of any model or application that does not depend on the software used to implement it.
A case in which the number of table attributes is very large but the number of actual data instances is low.
In the network model, the portion of the database seen by the application programs that produce the desired information from the data in the database.
A database view of information arranged in a grid of rows and columns
Horizontal row in table, describes a data item, also called record.
Unified Modeling Language (UML)
A language based on object-oriented concepts that provides tools such as diagrams and symbols to graphically model a system.