WGU C175/D426 DATA MANAGEMENT FOUNDATIONS OA EXAM 2023-2024 ACTUAL EXAM 2 LATEST VERSIONS 300 QUESTIONS AND CORRECT DETAILED ANSWERS WITH RATIONALES (100% CORRECT AND VERIFIED ANSWERS)|ALREADY GRADED A+

D426 data management foundations oa exam questions and answers
D426 data management foundations oa exam questions
D426 data management foundations oa exam answers
data management – foundations – d426
data management – foundations – d426 reddit
data management – foundations – c175 quizlet
data management – foundations – c175 reddit
data management foundations wgu

Which requirement within large, complex databases ensures users have limited access to the database?

Recovery
Performance
Authorization
Confidentiality

Authorization

Which design type specifies database requirements without regard to a specific database system?

Physical design
Conceptual design
Logical design
Abstract design

Conceptual design

3. What characterizes the rules of relational databases?

-They are logical constraints that ensure the data is valid.
-They are based on business policy and specific databases.
-They represent data volumes and rapidly changing data structures.
-They embody the theoretical foundation of the SQL language.

They are logical constraints that ensure the data is valid.

4. Which event will result in an error code and error description when using MySQL Server?

When the server or updates are incorrectly installed
When an SQL statement is entered to locate errors in the database
When an SQL statement is syntactically incorrect
When a Mac OS shortcut code is used on a Windows OS

When an SQL statement is syntactically incorrect

5. Which data type should be used to store whole integer values, such as age?

NUM
VARCHAR
DATE
INT

INT

6. Which text-based interface is included in the MySQL Server Download?

MySQL Command-Line Client
MySQL Workbench
MySQL Enterprise
MySQL Community

MySQL Command-Line Client

7. Which database system includes the World database during its installation?

MySQL
MongoDB
Oracle
PostgreSQL

MySQL

8. What is the standardized language of relational database systems?

Contextual Query Language
Structured Query Language
Select Query Language
Object Query Language

Structured Query Language

9. Which SQL sublanguage is used to manage database access?

Data Definition Language (DDL)
Data Query Language (DQL)
Data Manipulation Language (DML)
Data Control Language (DCL)

Data Control Language (DCL)

10. Which SQL sublanguage is used to roll back database changes?

Data Transaction Language (DTL)
Data Manipulation Language (DML)
Data Query Language (DQL)
Data Control Language (DCL)

Data Transaction Language (DTL)

11. Which format is used by the TIME data type in MySQL?

YYYY-MM-DD
YYYY-MM-DD hh:mm:ss
hh:mm:ss
hh:mm:ss YYYY-MM-DD

hh:mm:ss

12. Which MySQL operator is the last in the order of operator precedence?


OR
NOT
=

OR

13. Which operator is used to compare columns from the left and right tables in MySQL?

<=
+
=
*

=

14. Which type of join combines two tables without comparing columns?

SELF
EQUIJOIN
OUTER
CROSS

CROSS

15. Which type of join compares columns using only the = operator?

Equijoin
Non-equijoin
Outer join
Inner join

Equijoin

16. Which type of join is demonstrated by the following query?
SELECT Dog.Nickname, Kennel.Address
FROM Dog, Kennel
WHERE Dog.KennelID = Kennel.ID

NON-EQUIJOIN
SELF JOIN
CROSS JOIN
EQUIJOIN

EQUIJOIN

17. What is another name for a subquery?

Outer query
Nested query
Correlated query
Search query

Nested query

18. Which operator is used to compare against a list of values when determining a match in a WHERE clause?

IN
BETWEEN
LIKE
OR

IN

19. Which wildcard character is used to represent zero or more characters when searching for a specified pattern using a LIKE operator?

%
_
<
>

%

20. Which function is considered an aggregating function in SQL?

TRIM
REPLACE
MIN
SUBSTRING

MIN

21. Which clause or function is used with aggregate functions to produce summary rows?

TRIM
REPLACE
ORDER BY
GROUP BY

GROUP BY

22. Which type of join returns only the matching values when selecting rows from two or more tables?

Full Join
Outer Join
Equijoin
Inner Join

Inner Join

23. Which join type selects only matching left and right table rows?

Full Join
Left Join
Right Join
Inner Join

Inner Join

24. Which phrase refers to the view in which data is persisted and is automatically changed as the underlying data is changed?

Virtual View
Snapshot View
Denormalized View
Materialized View

Materialized View

25. Which SQL command uses the correct syntax to update the salary of an employee in the Employee table?

CHANGE Employee Salary = 50000 WHERE ID = 1;
UPDATE Employee SET Salary = 50000 WHERE ID = 1;
ALTER Employee SET Salary = 50000 WHERE ID = 1;
UPDATE Employee (Salary) = (50000) WHERE ID = 1;

UPDATE Employee SET Salary = 50000 WHERE ID = 1;

26. What describes elements such as column name and data type?

Alter
Table
Metadata
CRUD

Metadata

27. Which term identifies an ordered collection of elements enclosed in parenthesis in a tablespace?

Cell
Field
Column
Tuple

Tuple

28. Which SQL command uses the correct syntax to select the name and salary columns from the Employee table where the salary is greater than 50000, and sort the results by salary in descending order?

-SELECT name, salary FROM Employee WHERE salary > 50000 ORDER + BY salary DESC;
-SELECT Employee.name, Employee.salary WHERE Employee.salary > 50000 SORT BY Employee.salary DESC;
-SELECT name, salary FROM Employee SORT BY salary DESC WHERE salary > 50000;
-SELECT name, salary FROM Employee WHERE salary > 50000 SORT BY salary ASC;

SELECT name, salary FROM Employee WHERE salary > 50000 ORDER + BY salary DESC;

29. Which INSERT statement demonstrates valid syntax in SQL?

INSERT INTO table_name (column1, column2) SET value1, value2;
INSERT INTO table_name VALUES (value1, value2);
INSERT INTO table_name (column1, column2) VALUES (value1, value2);
INSERT INTO table_name SET column1 = value1, column2 = value2;

INSERT INTO table_name (column1, column2) VALUES (value1, value2);

30. Which UPDATE statement uses valid syntax in SQL?

UPDATE table_name column_name= value1 WHERE condition
UPDATE table_name SET column1 = value1, column2 = value2 WHERE condition;
UPDATE SET column1 = value1, column2 = value2 WHERE condition IN table_name;
UPDATE table_name SET column_name = value1

UPDATE table_name SET column1 = value1, column2 = value2 WHERE condition;

31. Which statement used to remove data from temporary tables?

DROP
DELETE
DEFAULT
TRUNCATE

TRUNCATE

32. Which keyword is a DDL (data definition language) keyword in SQL?

INSERT
UPDATE
DELETE
CREATE

CREATE

33. Which property is enforced by a primary key field?

Not Null
Null
Duplicate
Numeric

Not Null

34. Which property or function allows SQL to insert a field value?

Composite
Simple
Auto-increment
Numeric

Auto-increment

35. Which property specifies an expression on one or more columns of a table?

CHECK
RESTRICT
CASCADE
UNIQUE

CHECK

36. Which type of key is a single column in a table used to identify a row?

Super key
Simple primary key
Foreign key
Composite primary key

Simple primary key

37. Which type of key is often represented by parentheses to show multiple columns?

Composite
Simple
Unique
Foreign

Composite

38. Which primary key characteristic means it includes information that is easy to type and store?

Non-null
Meaningless
Stable
Simple

Simple

39. Which part of a junction table in a relational database establishes a many-to-many relationship between two tables?

Primary key
Foreign key
Composite key
Candidate key

Foreign key

40. Which type of relationship is established by a foreign key in a relational database?

Aggregation
Composition
One-to-many
Many-to-many

One-to-many

41. Which statements may be rejected when a foreign key constraint is specified?

INSERT
CHECK
SELECT
ORDER BY

INSERT

42. Which key describes the unique columns in a table that do not contain a primary key?

Foreign key
Simple key
Candidate key
Composite key

Candidate key

43. Which index stores column values and row pointers in a hierarchy?

Dense index
Sparse index
Multi-level index
Secondary index

Multi-level index

44. Which type of index refers to entries that are assigned to buckets?

Bitmap index
Logical index
Hash index
Secondary index

Hash index

45. Which type of index is a grid of bits where each index row corresponds to a unique row in a table?

Multi-level index
Logical index
Hash index
Bitmap index

Bitmap index

46. Which type of development activity is used to convert an entity-relationship model (ERM) into tables, columns, and keys?

Execution
Physical design
Analysis
Logical design

Logical design

47. Which term addresses columns of A being a subset of the columns of B, with A always depending on B?

Third normal form
Candidate key
Trivial dependency
Non-trivial dependency

Trivial dependency

48. Which term describes the process of denormalization?

Merging tables
Candidate key
Trivial dependency
Third normal form

Merging tables

49. A database designer is working with a table that has a primary key and no duplicate rows. What can be concluded about the table?

It is a Boyce-Codd table.
It contains trivial dependency.
The table contains a non-key column.
It is a first normal form table.

It is a first normal form table.

50. Which process eliminates redundancy by decomposing a table into two or more tables in a higher normal form?

Merging
Creation of a candidate key
Evaluation of trivial dependency
Normalization

Normalization

51. Which term describes the optimal normal form for frequent inserts, updates, and deletes of data?

Boyce-Codd normal form
Candidate key
Trivial dependency
Third normal form

Boyce-Codd normal form

52. Which relationship is depicted in the following entity-relationship (ER) diagram?

Ternary
Many-to-many
One-to-one
One-to-many

One-to-many

Image: 52. Which relationship is depicted in the following entity-relationship (ER) diagram?

Ternary
Many-to-many
One-to-one
One-to-many

53. Which relationship or association exists between a supertype entity and its subtype entities?

IsA relationship
Weak entity
Associative entity
Unary relationship

IsA relationship

54. Which symbol is used in an entity-relationship (ER) diagram for an entity?

Diamond
Square
Circle
Line

Square

55. Which symbol is used in an entity-relationship (ER) diagram to indicate an attribute?

Diamond
Square
Circle
Line

Circle

56. What must be done before creating supertype and subtype entities?

Identify entities.
Document entities.
Determine attribute maxima.
Document cardinality

Identify entities.

57. Which real-world object can be distinctly identified and grouped, such as all employees in a company?

Entity-relationship (ER) diagram
Glossary
Entity
Attribute type

Entity

Which item in an entity-relationship (ER) diagram follows the attribute name and is placed outside of parentheses?

Weak entity
Attribute maximum
Attribute minimum
Physical design

Attribute maximum

59. Which strategy can be used to identify primary keys when investigating data relationships?

Trend analysis
Unique identifier analysis
Hypothesis testing
Visualization

Unique identifier analysis

60. How should an entity relationship phrase be read to correctly understand how one entity relates to another entity in an entity-relationship diagram (ER diagram)?

From top to bottom
In the direction the entity box is facing
In the direction the relationship verb is facing
From left to right

In the direction the relationship verb is facing

Image: 60. How should an entity relationship phrase be read to correctly understand how one entity relates to another entity in an entity-relationship diagram (ER diagram)?

From top to bottom
In the direction the entity box is facing
In the direction the relationship verb is facing
From left to right

Associative Entity
An associative entity is an element of the entity-relationship model. All relationships for the associative entity should be many.

Attribute
An attribute is a property or characteristic of an entity.

Binary relationship
A binary relationship is a relationship between two entity types.

Candidate Key
Is any column or a combination of columns that can qualify as unique key in database.

Cardinality
Cardinality represents the maximum number of entities that can be involved in a particular relationship.

Cartesian product
Usually the result of a missing join condition or a method of expanding the data of 1 table by the number of rows in the second table.

Cascade Delete
Will delete all records that reference the primary key

column subquery
Returns a single column of one or more values.

data encryption
When data is encrypted, it is changed, bit by bit or character by character, into a form that looks totally garbled. It can and must be reconverted, or decrypted, back to its original form to be of use.

Data normalization
Is a methodology for organizing attributes into tables so that redundancy among the non-key attributes is eliminated.

Data volatility
Describes how often stored data is updated.

Data Volume Assessment
Understanding of how much data will be in a database or a table within a database

Database
A database is a collection of information that is organized so that it can easily be accessed, managed, and updated.

DCL
Data control language is used to control access to data stored in a database.

DDL
Data definition language – involves instructing the DBMS software on what tables will be in the database, what attributes will be in the tables, which attributes will be indexed, and so forth.

Definer
Definer is a MySQL term where AuthID is the same for another DBMS

Denormalization
The act of duplicating data in a database for performance or the preservation of historical information

disaster recovery
Involves rebuilding an entire information system or significant parts of one after a catastrophic natural disaster such as a hurricane, tornado, earthquake, building collapse, or even a major fire.

DML
Data manipulation language refers to the four basic operations that can and must be performed on data stored in any DBMS: data retrieval, data update, insertion of new records, and deletion of existing records.

E-R model
An E-R model is a data model for describing a database in an abstract way.

embedded mode
The SELECT command is embedded within the lines of a higher-level language program and functions as an input or “read” statement for the program.

Entity
An entity is an object or event in our environment that we want to keep track of.

Foreign Key
Is a field (or collection of fields) in one table that uniquely identifies a row of another table.

Index
Used to help a DBMS find data quickly

Inner Join
Shows row that have matches in both tables

Intersection Data
Intersection Data associated with the concatenation of two segments.

Join
Joins 2 tables together

logical view
Is a mapping onto a physical table or tables that allows an end user to access only a specified portion of data.

Modality
Modality represents the minimum number of entity occurrences that can be involved in a relationship.

outer join
Shows rows in one table that have no match in the other table. Two kinds of outer joins are left and right joins.

Primary Key
Uniquely identifies each record in the table.

query mode
The command goes directly to the relational DBMS, which evaluates the query and processes it against the database.

Referential Integrity
Referential integrity is a database concept that ensures that relationships between tables remain consistent.

referential integrity
Enforces rules to guarantee that the foreign key relationship stays intact with no mismatches.

Response time
Is the delay from the time that the Enter Key is pressed to execute a query until the result appears on screen.

Restrict Delete
Will not allow deletes if the primary key is referenced

row subquery
Returns a single row of one or more values.

scalar subquery
is the most restrictive subquery because it produces only a single value

Set-to-Null on Delete
Will set values to null when primary key is deleted

SQL
Is a comprehensive database management language which incorporates DML and DDL

subquery
One SELECT statement is “nested” within another.

table subquery
Returns a table of one or more rows of one or more columns.

Ternary Relationship
A ternary relationship is when three entities participate in the relationship.

Unary Relationships
Unary relationships associate occurrences of an entity type with other occurrences of the same entity type.

One-to-One Binary Relationship
It means that a single occurrence of one entity type can be associated with a single occurrence of the other entity type and vice versa.

Throughput
Is the measure of how many queries from simultaneous users must be satisfied in a given period of time by the application set and the database that it supports.

UNION
To create a result set that combines the results from several queries

Unique Identifier
A unique identifier is any identifier which is guaranteed to be unique among all identifiers used for those objects and for a specific purpose.

Association Rules
Descriptive – discovers links or associations amongst data

Business Intelligence
The transformation of raw data into useful information.

Classification
Predictive – maps data into predefined groups or classes

Clustering
Descriptive – groups similar data together into clusters

Extraction/Transformation/Loading
ETL is the process of extracting raw data and then transforming and loading into a target to be used with Business intelligence.

Regression
Predictive – used to map a data item to a real valued prediction variable

Sequence Discovery
Descriptive – discovers sequential patterns

Summarization Rules
Descriptive – maps data into subsets with associated simple descriptions or generalizations.

Time Series Analysis
Predictive – analysis information over time to predict future data

  • in the SELECT clause
    It indicates that all attributes of the selected row are to be retrieved.

AND operator
It displays a record if more than one condition is true.

AVG() function
It returns the average value of a numeric column.

BETWEEN operator
It allows you to specify a range of numeric values in a search.

Data definition
It is operationalized with a data definition language (DDL), involves instructing the DBMS software on what tables will be in the database, what attributes will be in the tables, which attributes will be indexed, and so forth.

Data management
There are two aspects of data management: data definition and data manipulation.

DISTINCT operator
It is used to eliminate duplicate rows in a query result.

IN operator
It allows you to specify a list of character strings to be included in a search.

JOIN clause
It is used to combine rows from more than one table, based on a common field between them.

LIKE operator
It allows you to specify partial character strings in a “wildcard” sense.

OR operator
It displays a record if either the first condition OR the second condition is true.

ORDER BY clause
It simply takes the results of a SQL query and orders them by one or more specified attributes.

SELECT command
Data retrieval in SQL is accomplished with the SELECT command.

Structured Query Language
It is a standard language for data management in relational databases, known as Structured Query Language or SQL.

Subquery
When one SELECT statement is “nested” within another in a format, it is known as subquery.

Data
Facts gathered together for analysis

Flat Files
A file having no internal hierarchy

Hashed Files
A file that has been encrypted for security purposes

Heap Files
An unsorted set of records

Information
The transformation of raw data into useful facts

Punch Card
A card that is perforated and can hold commands or data

Structured Data
Information with a high degree of organization

Unstructured Data
Information that does not have structure (such as text)

ETL process
Get the data from the source location.
Map the data from its original form into a data model that is suitable for manipulation at the staging area.
Validate and clean the data.
Apply any transformations to the data that are required before the data sets are loaded into the repository.
Map the data from its staging area model to its loading model.
Move the data set to the repository.
Load the data into the warehouse.

Estimation
It is a process of assigning some continuously valued numeric value to an object.

Description
It is the process of trying to characterize what has been discovered or trying to explain the results of the data mining process.

Affinity grouping
It is a process of evaluating relationships or associations between data elements.

They can serve as a communication tool between the users and designers.
Which of the following is true of business rules?

entity
A(n) _ is anything about which data are to be collected and stored.

XML/Hybrid data model
Oracle 11g is an example of the _.

entity instance
Each row in the relational table is known as a(n)

relational data model
MySQL is an example of the _.

attribute
A(n) _ is the equivalent of a field in a file system.

entity relationship
The _ model was developed to allow designers to use a graphical tool to examine structures rather than describing them with text.

entity relationship
The _ model uses the term connectivity to label the relationship types.

1980s
The object-oriented data model was developed in the _ .

network
In the _ model, the user perceives the database as a collection of records in 1:M relationships, where each record can have more than one parent.

constraints
_ are important because they help to ensure data integrity.

Crow’s Foot notation
In __, a three pronged symbol represents the “many” side of the relationship.

data definition language (DDL)
A(n) _ enables a database administrator to describe schema components.

file system data model
VMS/VSAM is an example of the _.

1960s
The hierarchical data model was developed in the _.

hierarchical
One of the limitations of the _ model is that there is a lack of standards.

entity
A(n) _ represents a particular type of object in the real world.

data manipulation language (DML)
A(n) __ defines the environment in which data can be managed and is used to work with the data in the database.

constraint
A(n) _ is a restriction placed on the data.

They are not based on the relational model.
Which of the following is true of NoSQL databases?

provide fault tolerance.
NoSQL databases:

relationship
A verb associating two nouns in a business rule translates to a(n) _ in the data model.

business rule
A(n) _ is a brief, precise, and unambiguous description of a policy, procedure, or principle within a specific organization.

business rules
From a database point of view, the collection of data becomes meaningful only when it reflects properly defined _ .

It does not have standards
Which of the following is a disadvantage of the hierarchical data model?

relation
The relational model’s foundation is a mathematical concept known as a

relationship
A(n) _ is bidirectional.

1970s
The relational data model was developed in the _.

hierarchical
In the _ model, each parent can have many children, but each child has only one parent.

client node
Which of the following types of HDFS nodes acts as the interface between the user application and the HDFS?

attribute
Each column in a relation represents a(n)

method
In object oriented terms, a(n) __ defines an object’s behavior.

tuple
Each row in a relation is called a(n)

data model
A(n) __ is a relatively simple representation of more complex real-world data structures.

constraints
_ are normally expressed in the form of rules.

many-to-many
Students and classes have a _ relationship.

model
A(n) __‘s main function is to help one understand the complexities of the real world environment.

class
A(n) _ is a collection of similar objects with a shared structure and behavior.

entity
A noun in a business rule translates to a(n) _ in the data model.

schema
A(n) _ is the conceptual organization of an entire database as viewed by a database administrator.

name node
Which of the following types of HDFS nodes stores all the metadata about a file system?

hierarchical
In the _ model, the basic logical structure is represented as an upside-down tree.

object-oriented
The _ data model is said to be a semantic data model.

segment
A(n) _ in a hierarchical model is the equivalent of a record in a file system.

object-oriented
The _ data model uses the concept of inheritance.

unique
When you define a table’s primary key, the DBMS automatically creates a(n) _ index on the primary key column(s) you declared.

data dictionary
A(n) _ provides a detailed description of all tables found within the user/designer-created database.

1:M
The _ relationship is the relational database norm.

candidate
A _ key can be described as a minimal superkey, a superkey without any unnecessary attributes.

M:N
________relationships cannot be implemented as such in the relational model.

bridge
Another name for a composite entity is a(n) _ entity.

1:M
The _ relationship is the “relational model ideal.”

secondary
A _ key is defined as a key that is used strictly for data retrieval purposes.

entity
The CUSTOMER table’s primary key is CUS_CODE. The CUSTOMER primary key column has no null entries, and all entries are unique. This is an example of _ integrity.

system
The _ catalog can be described as a detailed system data dictionary that describes all objects within the database, including data about table names, the table’s creator and creation date, the number of columns in each table, the data type corresponding to each column, index filenames, index creators, authorized users, and access privileges.

M:N
_ relationships can be implemented by creating a new entity in 1:M relationships with the original entities.

predicate
_ logic, used extensively in mathematics, provides a framework in which an assertion (statement of fact) can be verified as either true or false.

index
A(n) _ is an orderly arrangement used to logically access rows in a table.

keys
In the relational model, _ are important because they are used to ensure that each row in a table is uniquely identifiable.

superkey
A _ is any key that uniquely identifies each row.

1:1
If one department chair “a professor “can chair only one department, and one department can have only one department chair. The entities PROFESSOR and DEPARTMENT exhibit a(n) _ relationship.

column
Each table _ represents an attribute.

attributes
An index key can have multiple _ (a composite index).

integrity
Referential _ dictates that the foreign key must contain values that match the primary key in the related table, or must contain null.

keys
In a relational model, _ are also used to establish relationships among tables and to ensure the integrity of the data

attribute
In a relational table, each column has a specific range of values known as the _ domain.

relation
A table is also called a(n) _ because the relational model’s creator, E. F. Codd, used the two terms as synonyms.

foreign key
A _ is the primary key of one table that has been placed into another table to create a common attribute.

candidate
A primary key is a(n) _ key chosen to be the primary means by which rows of a table are uniquely identified.

NOT NULL
The _ constraint can be placed on a column to ensure that every row in the table has a value for that column.

1:1
The _ relationship should be rare in any relational database design.

Julian
Date attributes contain calendar dates stored in a special format known as the _ date format.

unique
A(n) __ index is an index in which the index key can have only one pointer value (row) associated with it.

foreign
The proper use of _ keys is crucial to controlling data redundancy.

table
You can think of a _ as a persistent representation of a logical relation.

RDBMSs
_ enforce integrity rules automatically.

warehousing
Proper data __ design requires carefully defined and controlled data redundancies to function properly.

synonym
In a database context, a(n) _ indicates the use of different names to describe the same attribute.

flags
To avoid nulls, some designers use special codes, known as _ , to indicate the absence of some value.

1:1
One characteristic of generalization hierarchies is that they are implemented as __ relationships.

system catalog
The _ is actually a system-created database whose tables store the user/designer-created database characteristics and contents.

derived
The decision to store _ attributes in database tables depends on the processing requirements and the constraints placed on a particular application.

small circle
In Crow’s Foot notation, an optional relationship between entities is shown by drawing a(n) _ on the side of the optional entity.

one
Ideally, an entity identifier is composed of _ attribute(s).

entity
Knowing the minimum and maximum number of _ occurrences is very helpful at the application software level.

binary
To simplify the conceptual design, most higher-order relationships are decomposed into appropriate equivalent _ relationships whenever possible.

1:1
The Crow’s foot symbol with two vertical parallel lines indicates _ cardinality.

existence
A weak entity must be _
dependent.

dashed line
A derived attribute is indicated in the Chen notation by a _ that connects the attribute and an entity.

degree
A relationship
__ indicates the number of entities or participants associated with a relationship.

optional
Participation is _ if one entity occurrence does not require a corresponding entity occurrence in a particular relationship.

iterative
What process is based on repetition of processes and procedures.

symbols
When the specific cardinalities are not included on the diagram in Crow’s Foot notation, cardinality is implied by the use of _.

single-valued
A person’s Social Security number would be an example of a(n) _ attribute.

minimum
When indicating cardinality, the first value represents the _ number of associated entities.

unary
A _ relationship exists when an association is maintained within a single entity.

primary key
The concept of relationship strength is based on how the _ of a related entity is defined.

optional
The existence of a(n) _ entity indicates that its minimum cardinality is zero.

cardinality
_ expresses the minimum and maximum number of entity occurrences associated with one occurrence of the related entity.

rectangle
The Chen notation identifies a weak entity by using a double-walled entity __

ternary
A _ relationship exists when three entities are associated.

composite
A _ attribute can be further subdivided to yield additional attributes.

UML
The _ notation of entity-relationship modelling can be used for both conceptual and implementation modelling.

required
A(n) __ attribute is an attribute that must have a value.

solid
When using the Crow’s Foot notation, the associative entity is indicated by _ relationship lines between the parents and the associative entity.

information
Complex _ requirements may dictate data transformations, and they may expand the number of entities and attributes within the design.

relationships
Identifying the attributes of entities helps in the better understanding of _ among entities.

M:N
The entity relationship model uses the associative entity to represent a(n) _ relationship between two or more entities.

recursive
If an employee within an EMPLOYEE entity has a relationship with itself, that relationship is known as a _ relationship.

participation
Failure to understand the distinction between mandatory and optional __ in relationships might yield designs in which awkward (and unnecessary) temporary rows (entity instances) must be created just to accommodate the creation of required entities.

derived
A(n) _ attribute need not be physically stored within the database.

attributes
_ are characteristics of entities.

creating a detailed narrative of the organization’s description of operations
The first step in building an entity-relationship diagram (ERD) is _ .

classification
A relationship _ is difficult to establish if only one side of the relationship is known.

strong
If an entity can exist apart from all of its related entities, then it is existence-independent, and it is referred to as a(n) _ entity.

high processing speeds
In organizations that generate large number of transactions, _ are often a top priority in database design.

weak
A _ entity has a primary key that is partially or totally derived from the parent entity in the relationship.

solid
The Crow’s Foot notation depicts the strong relationship with a(n) __ line between the entities.

documentation
_ not only helps database designers to stay on track during the design process, it also enables them to pick up the design thread when the time comes to modify the design.

conceptual
The entity relationship diagram (ERD) represents the _ database as viewed by the end user.

domain
A(n) _ is the set of possible values for a given attribute.

optional to
If Tiny College has some departments that are classified as “research only” and do not offer courses, the COURSE entity of the college database would be _ the DEPARTMENT entity.

strong
A(n) _ relationship is also known as an identifying relationship.

M:N
The conceptual model can handle _ relationships and multivalued attributes.

simple
A _ attribute is one that cannot be subdivided.

identifiers
_ are underlined in an ER diagram.

existence
An entity is said to be _ -dependent if it can exist in the database only when it is associated with another related entity occurrence.

entities
A relationship is an association between _.

primary
Because a partial dependency can exist only when a table’s primary key is composed of several attributes, a table whose __ key consists of only a single attribute is automatically in 2NF once it is in 1NF.

partial dependency
A dependency based on only a part of a composite primary key is called a

normal forms
Normalization works through a series of stages called

prime
An attribute that is part of a key is known as a(n) _ attribute.

first normal form
All relational tables satisfy the _ requirements.

partial dependency
A dependency based on only a part of a composite primary key is known as a _

anomalies
A table that displays data redundancies yields _.

4NF
Some very specialized applications may require normalization beyond the _.

3NF
A table that is in 2NF and contains no transitive dependencies is said to be in _.

repeating group
A relational table must not contain a(n) _.

partial
Dependencies based on only a part of a composite primary key are known as _ dependencies.

single entity
Repeating groups must be eliminated by ensuring that each row defines a

granularity
refers to the level of detail represented by the values stored in a table’s row.

Crow Foot notation ERDs
When designing a new database structure based on the business requirements of the end users, the database designer will construct a data model using a technique such as __.

3NF
For most business transactional databases, we should normalize relations into _.

transitive dependency
A(n) _ exists when there are functional dependencies such that Y is functionally dependent on X, Z is functionally dependent on Y, and X is the primary key.

anomalies
The problem with transitive dependencies is that they still yield data

4NF
In the __ , no row may contain two or more multivalued facts about an entity.

three
If you have three different transitive dependencies, _ different determinant(s) exist.

2NF
A table that is in 1NF and includes no partial dependencies is said to be in _.

1NF
A table that has all key attributes defined, has no repeating groups, and all its attributes are dependent on the primary key is said to be in _.

partial dependency
A(n) _ exists when there are functional dependencies such that XY is functionally dependent on WZ, X is functionally dependent on W, and XY is the primary key.

the level of detail represented by the values in a table’s row
Granularity refers to _.

2NF
Before converting a table into 3NF, it is imperative that the table already be in _.

normalization
is a process to help reduce the likelihood of data anomalies.

DKNF
Of the following normal forms, _ is mostly of theoretical interest.

three
Normalization works through a series of stages called normal forms. For most purposes in business database design, _ stages are as high as you need to go in the normalization process.

prime attribute
Any attribute that is at least part of a key is known as a __.

2NF
It is possible for a table in __ to exhibit transitive dependency, where the primary key may rely on one or more nonprime attributes to functionally determine other nonprime attributes.

atomicity
An attribute that cannot be further subdivided is said to display __.

determinant
Any attribute whose value determines other values within a row is known as a __.

normalization stages
1NF, 2NF, and 3NF are _.

dependency
In a(n) _ diagram, the arrows above the attributes indicate all desirable dependencies.

derived
From a system functionality point of view, _ attribute values can be calculated when they are needed to write reports or invoices.

normalization
The objective of __ is to ensure that each table conforms to the concept of well-formed relations.

3NF
Most designers consider the BCNF as a special case of the _.

make sure that entities are in normal form before table structures are created
When designing a database, you should _.

surrogate
In a real-world environment, changing granularity requirements might dictate changes in primary key selection, and those changes might ultimately require the use of _ keys.

data integrity problems
Data redundancy produces _ .

cannot be further subdivided
An atomic attribute _ .

3NF and the BCNF
When a table contains only one candidate key, _ are considered to be equivalent.

relational
All __ tables satisfy the 1NF requirements.

atomicity
Improving _ leads to more flexible queries.

natural
A(n) _ join links tables by selecting only the rows with common values in their common attribute(s).

diagram
Dependencies can be identified with the help of a dependency _.

repeating group
A _ derives its name from the fact that a collection of multiple entries of the same type can exist for any single key attribute occurrence.

candidate
BCNF can be violated only if the table contains more than one _ key.

determines
Attribute A _ attribute B if all of the rows in the table that agree in value for attribute A also agree in value for attribute B.

BCNF
A table is in __ if every determinant in the table is a candidate key.

1NF
From a structural point of view, 2NF is better than _.

concept of keys
The _ is central to a discussion of normalization.

2NF
From a structural point of view, 3NF is better than _.

partial dependencies
If a table has multiple candidate keys and one of those candidate keys is a composite key, the table can have _ based on this composite candidate key even when the primary key chosen is a single attribute.

LIKE
The special operator used to check whether an attribute value matches a given string pattern is _.

SELECT
To make the output more readable, the SQL standard permits the use of aliases for any column in a __ statement.

WHERE
ANSI-standard SQL allows the use of special operators in conjunction with the __ clause.

restrictions
You can select partial table contents by naming the desired fields and by placing __ on the rows to be included in the output.

logical
SQL allows the use of __ restrictions on its inquiries such as OR, AND, and NOT.

Performing operations within parentheses
According to the rules of precedence, which of the following computations should be completed first?

SELECT
The SQL command that allows a user to list the contents of a table is _.

queries
A database language enables the user to perform complex __ designed to transform the raw data into useful information.

HAVING
The SQL data manipulation command __ restricts the selection of grouped rows based on a condition:

ISO
The ANSI SQL standards are also accepted by the _.

IS NULL
The _ special operator is used to check whether an attribute value is null.

COMMIT
The SQL command that allows a user to permanently save data changes is _.

cascading
A(n) __ order sequence is a multilevel ordered sequence that can be created easily by listing several attributes, separated by commas, after the ORDER BY clause.

SELECT
The _ command, coupled with appropriate search conditions, is an incredibly powerful tool that enables a user to transform data into information.

EXISTS
The special operator used to check whether a subquery returns any rows is _.

alias
A(n) _ is an alternate name given to a column or table in any SQL statement.

Microsoft Access
Some RDBMSs, such as _ , automatically make the necessary conversions to eliminate case sensitivity.

one hundred
The basic SQL vocabulary has fewer than __ words.

INSERT
The SQL command that allows a user to insert rows into a table is _.

query
In the SQL environment, the word _ covers both questions and actions.

BETWEEN
The special operator used to check whether an attribute value is within a range of values is _.

Boolean
A specialty field in mathematics, known as __ algebra, is dedicated to the use of logical operators.

conditional
in SQL, all __ expressions evaluate to true or false.

Business intelligence
_ tools focus on the strategic and tactical use of information.

data warehouse
Business intelligence tools use the _ data as the raw materials for data analytics to generate business knowledge.

Business intelligence
__ functionality ranges from simple data gathering and transformation to very complex data analysis and presentation.

framework
Business intelligence is a __ that allows a business to transform data into information, information into knowledge, and knowledge into wisdom

architecture
Business intelligence (BI) __ is composed of data, people, processes, technology, and the management of such components.

Business intelligence
_ is a term used to describe a comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate, store, and analyze data with the purpose of generating and presenting information used to support business decision making.

DBMS
A(n) __ is a collection of programs that manages the database structure and controls access to the data stored in the database.

single-user
A desktop database is a _ database.

structured data
_ are the result of formatting disorganized data in order to facilitate storage, use and generation of information.

query result set
The response of the DBMS to a query is the __.

unstructured
_ data exist in the format in which they were collected.

DBMSs
_ serve as the intermediary between the user and the database.

enterprise
A(n) __ database is used by an organization and supports many users across many departments.

independence
Data __ exists when it is possible to make changes in the data storage characteristics without affecting an application program’s ability to access data.

unstructured
_ data exist in a format that does not lend itself to processing that yields information.

context
To reveal meaning, information requires _.

metadata
__ describe the data characteristics and the set of relationships that links the data found within the database.

record
A _ is a logically connected set of one or more fields that describes a person, place, or thing.

ad hoc query
A(n) __ is a spur-of-the-moment question.

database system
The term _ refers to an organization of components that define and regulate the collection, storage, management and use of data within a database environment.

data redundancy
exists when the same data are stored unnecessarily at different places.

They enhance the user’s ability to understand the data.
Which of the following is true of spreadsheet applications?

its expected use
The organization of data within folders in a manual file system is determined by _.

operational databases
Data warehouse contains historical data obtained from the_____.

multiuser
A workgroup database is a(n) _ database.

good decision making
Accurate, relevant, and timely information is the key to _.

field
A _ is a character or group of characters that has a specific meaning.

application
A(n) _ might be written by a programmer or it might be created through a DBMS utility program.

performance tuning
_ relates to the activities that make the database execute transactions more efficiently in terms of storage and access speed.

the data always yields consistent results
Data is said to be verifiable if:

raw facts of interest to the end user
End-user data is __.

workgroup
A(n) _ database supports a relatively small number of users (usually fewer than 50) or a specific department within an organization.

data dictionary
The DBMS uses the __ to look up the required data component structures and relationships, thus relieving programmers from having to code such complex relationships in each program.

data inconsistency
_ exists when different and conflicting versions of the same data appear in different places.

In-memory databases
_ refer to a type of database that stores most of its data in RAM rather than in hard disks.

metadata
__ provide a description of the data characteristics and the set of relationships that link the data found within the database.

anomaly
A(n) _ develops when all required changes in the redundant data are not made successfully.

information
_ is the result of revealing the meaning of raw facts.

data
Information is produced by processing _.

analytical
__ databases focus primarily on storing data used to generate information required to make tactical or strategic decisions.

data dictionary
A _ contains at least all of the attribute names and characteristics for each table in the system.

information
__ is the result of processing raw data to reveal its meaning.

knowledge
_ is the body of information and facts about a specific subject.

data redundancy
Which of the following refers to the situation where the same data is stored unnecessarily at different places?

data inconsistency
_ exists when different versions of the same data appear in different places.

data integrity
_ is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.

collection of files
The database structure in a DBMS is stored as a __.

file
A _ is a collection of related records.

semistructured
An XML database supports the storage and management of _ XML data.

XML
__ is a special language used to represent and manipulate data elements in a textual format.

formatted
Raw data must be properly __ for storage, processing and presentation.

query
A(n) __ is a specific request issued to the DBMS for data manipulation

semistructured
Most data that can be encountered are best classified as _ .

islands of information
The term __ refers to scattered locations storing the same basic data.

metadata
__ is data about data through which the end-user data are integrated and managed

performance tuning
_ relates to activities that make a database operate more efficiently in terms of storage and access speed.

ad hoc query
A “spur-of-the-moment” question

analytical database
A database focused primarily on storing historical data and business metrics used for tactical or strategic decision making

availability
In the context of data security, it refers to the accessibility of data whenever required by authorized users and for authorized purposes

centralized database
A database located at a single site

cloud database
A database that is created and maintained using cloud services, such as Microsoft Azure or Amazon AWS

data
Raw facts, or facts that have not yet been processed to reveal their meaning to the end user

data anomaly
An abnormality of the data in which inconsistent changes have been made to a database (For example, an employee moves, but the address changes is not corrected in all files in the database)

data dependence
A data condition in which data representation and manipulation are dependent on the physical data storage characteristics

data dictionary
A DBMS component that stores metadata. Contains the data definition, as well as their characteristics and relationships

data inconsistency
A condition in which different versions of the same data yield different results

data independence
A condition in which data access is unaffected by changes in the physical data storage characteristics

data integrity
In a relational database, a condition in which the data in the database complies with all entity and referential integrity constraints

data management
A process that focuses on data collection, storage, and retrieval.

data processing (DP) specialist
The person responsible for developing and managing a computerized file processing system

data redundancy
Exists when the same data is stored unnecessarily at different places

database
A shared, integrated computer structure that houses a collection of related data including end-user data (raw facts) and metadata (data about the data)

database design
The process that yields the description of the database structure and determines the database components. The second phase of the database life cycle

database management system (DBMS)
The collection of programs that manages the database structure and controls access to the data stored in the database

database system
An organization of components that defines and regulates the collection, storage, management, and use of data in a database environment

desktop database
A single-user database that runs on a personal computer

discipline-specific database
A database that contains data focused on specific subject areas

enterprise database
The overall company data representation, which provides support for present and expected future needs

field
An alphabetic or numeric character or group of characters that defines a characteristic of a person, place, or thing. (for example, a person’s SSN, address, phone number, and bank balance all constitute fields)

file
A named collection of related records

general-purpose database
A database that contains a wide variety of data used in multiple disciplines

hub
A warehouse of data packets housed in a central location on a local area network. It contains multiple ports that copy the data in the data packets to make it accessible to selected or all segments of the network

information
The result of processing raw data to reveal its meaning. It consists of transformed data and facilitates decision making

islands of information
In the old file system environment, pools of independent, often duplicated, and inconsistent data created and managed by different departments

knowledge
The body of information and facts about a specific subject. It implies familiarity, awareness, and understanding of information as it applies to an environment

logical data format
The way a person views data within the context of a problem domain

metadata
Data about data

multiuser database
A database that supports multiple concurrent users

NoSQL
A new generation of database management systems that is not based on the traditional relational database model. It stores structured and unstructured data in efficient ways.

online analytical processing (OLAP)
Decision support system (DSS) tools that use multidimensional data analysis techniques. It creates an advanced data analysis environment that supports decision making, business modeling, and operations research

operational database
A database designed primarily to support a company’s day-to-day operations. (Also known as a transactional database, OLTP database, or production database)

performance tuning
Activities that make a database perform more efficiently in terms of storage and access speed

physical data format
The way a computer “sees” (stores) data

query
A question or task asked by an end user of a database in the form of SQL code. A specific request for data manipulation issued by the end user or the application to the DBMS

query language
A nonprocedural language that is used by a DBMS to manipulate its data (such as SQL)

query result set
The collection of data rows returned by a query

record
A collection of related (logically connected) fields

role
In Oracle, a named collection of database access privileges that authorize a user to connect to a database and use its system resources

router
1) An intelligent devise used to connect dissimilar networks
2) Hardware/software equipment that connects multiple and diverse networks

semistructured data
Data that has already been processed to some extent

single-user database
A database that supports only one user at a time

social media
Web and mobile technologies that enable “anywhere, anytime, always on” human interactions

structural dependence
A data characteristic in which a change in the database schema affects data access, thus requiring changes in all access programs

structural independence
A data characteristic in which changes in the database schema do not affect data access

structured data
Data that has been formatted to facilitate storage, use, and information generation

structured query language (SQL)
A powerful and flexible relational database language composed of commands that enable users to create database and table structures, perform various types of data manipulation and data administration, and query the database to extract useful information

unstructured data
Data that exists in its original, raw state; that is, in the format in which it was collected

website
Refers to the web server and the collection of web pages stored on the local hard disk of the server computer

workgroup database
A multiuser database that usually supports fewer than 50 users or is used for a specific department in an organization

world wide web (WWW or the web)
Worldwide network collection of specially formatted and interconnected documents known as web pages

XML database
A database system that stores and manages semistructured XML data

file structure
The format that data is arranged and stored

flat files
Files having no internal hierarchy

heap files
Files containing an unsorted set of records that are uniquely identified by a record ID which allows them to be inserted or deleted using that ID

index files
Files that store a list of lookup field values from a data file

hashed files
Files that are encrypted using hash functions that convert data consisting of various formats into numeric values

data management
The discipline that focuses on the proper generation, storage, and retrieval of data

distributed database
Supports data across multiple sites

general purpose database
Contains a wide variety of data used in multiple disciplines

business intelligence
A comprehensive approach to capture and process business data with the purpose of generating information to support business decision making

semi-structured data
Data that is already processed to some extent

extensible markup language (XML)
Used to represent and manipulate data elements in a textual format

data dictionary management
Removes structural and data dependency and provides the user with data abstraction. The DBMS uses this function to look up the required data component structures and relationships

data transformation and presentation
Exists to transform any data entered into required data structures. By using this function, the DBMS can determine the difference between logical and physical data formats

security management
Sets rules that determine specific users that are allowed to access the database. This function also sets restraints on what specific data any user can see or manage

data modeling
The process of creating a specific data model for a determined problem domain

problem domain
A clearly defined area within the real-world environment, with a well-defined scope and boundaries that will be systematically addressed

data model
A representation, usually graphic, of a complex “real-world” data structure. Used in the database design phase of the database life cycle. It represents data structures and their characteristics, relations, constraints, transformations, and other constructs with the purpose of supporting a specific problem domain. A blueprint with all the instructions to build a database that will meet all end-user requirements.

basic building blocks of all data models
Constraints, attributes, relationships, entities (data models CARE!)

entity
A person, place, thing, concept, or event for which data can be sored

attribute
A characteristic of an entity or object. It has a name and a data type.

relationship
An association between entities

constraint
A restriction placed on data, usually expressed in the form of rules

one-to-one (1:) relationship
Exists between a pair of tables when a single record in the first table is related to only one record in the second table, and a single record in the second table is related to only one record in the first table

one-to-many (1:M) relationship
Exists between a pair of tables when a single record in the first single record in the first table can be related to one or more records in the second table, but a single record in the second table can be related to only one record in the first table.

many-to-many (M:M) relationship
Exists between a pair of tables when a single record in the first table can be related to one or more records in the second table and a single record in the second table can be related to one or more records in the first table.

business rule
A description of a policy, procedure, or principle within an organization.

hierarchical model
An early database model whose basic concepts and characteristics formed the basis for subsequent database development. It is based on an upside-down tree structure in which each record is called a segment. The top record is the root segment. Each segment has a 1:M relationship to the segment directly below it.

segment
In the hierarchical data model, the equivalent of a file system’s record type.

network model
An early data model that represented data as a collection of record types in 1:M relationships.

schema
A logical grouping of database objects, such as tables, indexes, views, and queries that are related to each other.

subschema
The portion of the database that interacts with application programs

data manipulation language (DML)
The set of commands that allows an end user to manipulate the data in the database, such as SELECT, INSERT, UPDATE, DELETE etc.

data definition language (DDL)
The language that allows a database administrator to define the database structure, schema, and subschema

relational model
Based on mathematical set theory and represents data as independent relations. Each relation (table) is related to each other through the sharing of common entity characteristics. Developed by E. F. Codd of IBM in 1970

relation or table
A two-dimensional structure composed of intersecting rows (entities) and columns (attributes).

tuple
A row in the relational model

relational database management system (RDBMS)
A collection of programs that manages a relational database

SQL-based relational database parts

  1. End-user interface
  2. A collection of tables stored in the database
  3. SQL engine

entity relationship (ER) model (ERM)
A data model that describes relationships (1:1, 1:M, and M:N) among entities at the conceptual level with the help of ER diagrams.

entity relationship diagram (ERD)
A diagram that depicts an entity relationship model’s entities, attributes, and relations

entity instance or entity occurance
A row in a relational table

entity set
A collection of like entities

object-oriented data model (OODM)
A data model whose basic modeling structure is an object

object
An abstract representation of a real world entity that has a unique identity, embedded properties, and the ability to interact with other objects and itself

object-oriented database management system (OODBMS)
Data management software used to manage data in an object-oriented database model

relational database
A collection of relations that contain the data describing a particular business environment.

class
A collection of similar objects with shared structure (attributes) and behavior (methods).

method
In the object-oriented data model, a named set of instructions to perform an action.

class hierarchy
The organization of classes in a hierarchical tree in which each parent class is a superclass and each child class is a subclass.

inheritance
In the object-oriented data model, the ability of an object to inherit the data structure and methods of the classes above it in the class hierarchy

unified modeling language (UML)
A language based on object-oriented concepts that provides tools such as diagrams and symbols to graphically model a system.

extended relational data model (ERDM)
A model that includes the object-oriented model’s best features in an inherently simpler relational database structural environment

object/relational database management system (O/R DBMS)
A DBMS based on the extended relational model (ERDM).

big data
A movement to find new and better ways to manage large amounts of web-generated data and derive business insight from it, while simultaneously providing high performance and scalability at a reasonable cost

volume, velocity, and variety (3 Vs)
Three basic characteristics of big data databases.

volume
In big data, this refers to the amounts of data being stored

velocity
In big data, this refers to the speed in which data grows and the need to process this data quickly in order to generate information and insight.

variety
In big data, this refers to the fact that the data being collected comes in multiple different data formats.

Hadoop
A Java based, open source, high speed, fault-tolerant distributed storage and computational framework for big data.

Hadoop Distributed File System (HDFS)
A highly distributed, fault-tolerant file storage system designed to manage large amounts of data a high speeds.

name node
One of three types of nodes used in the Hadoop Distributed File System (HDFS). It stores all the metadata about the file system.

data node
One of the three types of nodes used in the Hadoop Distributed File System (HDFS). It stores fixed-size data blocks.

client node
One of the three types of nodes used in the Hadoop Distributed File System (HDFS). It acts as the interface between the user application and the HDFS.

MapReduce
An open-source application programming interface (API) that provides fast data analytics services used for big data.

key-value
A data model based on a structure composed of two data elements: a key and a value, in which every key has a corresponding value or set of values. Also called the associative or attribute-value data model

sparse data
A case in which the number of table attributes is very large but the number of actual data instances is low

eventual consistency
A model for database consistency in which updates to the database will propagate through the system so that all data copies will be consistent eventually.

3 vs
Three basic characteristics of big data databases: volume, velocity, and variety

abstract data type (ADT)
Data type that describes a set of similar objects with shared and encapsulated data representation and methods. It is generally used to describe complex objects

American National Standards Institute (ANSI)
The group that accepted the DBTG recommendations and augmented database standards in 17=975 through its SPARC committee

application programming interface (API)
Software through which programmers interact with middleware. It allows the use of generic SQL code, thereby allowing client processes to be database server-independent

balancing
Ensuring that the processing load is distributed evenly among servers

Chen notation
Same as entity relationship (ER) model

class diagram notation
The set of symbols used in the creation of class diagrams in UML object mdeling

class diagrams
A diagram used to represent data and their relationships in UML object notation

client
Any process that requests specific services from server processes in a client/server environment

complex object
An object formed by several different objects in complex relationships

conceptual model
The output of the conceptual design process. It provides a global view of an entire database and describes the main data objects, avoiding details

conceptual schema
A representation of the conceptual model, usually expressed graphically

connectivity
The type of relationship between entities. Classifications include 1:1, 1:M, and M:N

Crow’s Foot notation
A representation of the entity relationship diagram that uses a three-pronged symbol to represent the “many” sides of the relationship

entity instance
A row in a relational table. Also known as entity occurrence

entity occurrence
A row in a relational table. Also known as entity instance

entity relationship model (ERM)
A data model that describes relationships (1:1, 1:M, and M:N) among entities at the conceptual level with the help of ER diagrams. Developed by Peter Chen

external model
The application programmer’s view of the data environment. Given its business focus, this works with a data subset of the global database schema

external schema
The specific representation of an external view; the end user’s view of the data environment

Hadoop distributed file system (HDFS)
A highly distributed, fault-tolerant file storage system designed to manage large amounts of data at high speeds

hardware independence
A condition in which a model does not depend on the hardware used in the model’s implementation. Changes in the hardware will have no effect on the database design at the conceptual level

internal model
In database modeling, a level of data abstraction that adapts the conceptual model to a specific DBMS model for implementation. It is the representation of a database as “seen” by the DBMS

internal schema
A representation of an internal model using the database constructs supported by the chosen database

internet
A global network of computers connected together through a standard network protocol known as Transmission Control Protocol/Internet Protocol (TCP/IP)

relation
A logical construct perceived to be a two dimensional structure composed of intersecting rows (entities) and columns (attributes) that represents an entity set in the relational model

relational diagram
A graphical representation of a relational database’s entities, the attributes within those entities, and the relationships among the entities

superclass
In a class hierarchy, this is the more general classification from which the subclasses inherit data structures and behaviors

table
A logical construct perceived to be a two dimensional structure composed of intersecting rows (entities) and columns (attributes) that represents an entity set in the relational model

versioning
A property of an OODBMS that allows the database to keep track of the different transformations performed on an object

candidate key
A minimal superkey; a key that does not contain a subset of attributes that is itself a superkey

composite entity
An entity designed to transform a M:N relationship into two 1:M relationships. It’s primary key comprises at least the primary key of the entities that it connects.

composite key
A multiple-attribute key

dependent
An attribute whose value is determined by another attribute

determination
The role of a key. In the context of a database table, the statement “A determines B” indicates that knowing the value of attribute A means that the value of B can be looked up

domain
In data modeling,the construct used to organize and describe an attribute’s set of possible values

entity integrity
The property of a relational table that guarantees each entity has a unique value in a primary key and that the key has no null values

flags
Special codes implemented by designers to trigger a required response, alert end users to specified conditions, or encode values. They may be used to prevent nulls by bringing attention to the absence of a value in a table

foreign key (FK)
An attribute or attributes in one table whose values must match the primary key in another table or whose values must be null

full functional dependence
A condition in which an attribute is functionally dependent on a composite key but not on any subset of the key

functional dependence
One attribute determines another

index
An ordered array of values and row IDs generally used to speed up and facilitate data retrieval

key
One or more attributes that determine other attributes

key attribute
The attributes that form a primary key

linking table
In the relational model, a table that implements a M:N relationship

null
The absence of an attribute value

predicate logic
Used extensively in mathematics to provide a framework in which a fact can be verified as true or false

primary key
In the relational model,an identifier composed of one or more attributes that uniquely identifies a row

prime attribute
A key attribute

referential integrity
A condition by which a dependent table’s foreign key must have either a null entry or a matching entry in the related table

secondary key
A key used to strictly for data retrieval purposes

set theory
A part of a mathematical science that deals with groups of things and is used as the basis for data manipulation in the relational model

superkey
An attribute or attributes that uniquely identify each entity in a table

unique index
An index in which the index key can have only one associated pointer value or row

binary relationship
An er term for an association (relationship) between two entities

cardinality
A property that assigns a specific value to connectivity and expresses the range of allowed entity occurrences associated with a single occurrence of the related entity

composite attribute
An attribute that can be further subdivided to yield additional attributes. For example, a phone number may be divided into an area code (630) and exchange number (421), and a four-digit code (9009)

composite identifier
In ER modeling, a key composed of more than one attribute

derived attribute
An attribute that does not physically exist within the entity and is derived via an algorithm. For example, the age attribute might be derived by subtracting the birth date from the current date

existence-dependent
A property of an entity whose existence depends on one or more other entities.

existence-independent
A property of an entity that can exist apart from one or more related entities

identifiers
One or more attributes that uniquely identify each entity instance

iterative process
A process based on repetition of steps and procedures

mandatory participation
A relationship in which one entity occurrence must have a corresponding occurrence in another entity

multivalued attributes
An attribute that can have many values for a single entity occurrence

non-identifying relationship
A relationship in which the primary key of the related entity does not contain a primary key component of the parent entity

optional attribute
In ER modeling, an attribute that does not require a value; therefore it can be left empty

optional participation
In ER modeling, a condition in which one entity occurrence does not require a corresponding entity occurrence in a particular relationship

participants
An ER term for entities that participate in a relationship

recursive relationshp
A relationship found within a single entity type

relational schema
The organization of a relational database as described by the database administrator

relationship degree
The number of entities or participants associated with a relationship It can be unary, binary, ternary, or higher

required attribute
In ER modeling, an attribute that must have a value. It cannot be left empty

simple attribute
An attribute that cannot be subdivided into meaningful components

single-valued attribute
An attribute that can have only one value

strong (identifying) relationship
A relationship that occurs when two entities are existence-dependent; from a database design perspective, this relationship exists whenever the primary key of the related entity contains the primary key of the parent entity

strong entity
An entity that is existence-independent, that is, it can exist apart from all of its related entities

ternary relationship
An ER term used to describe an association (relationship) between three entities

unary relationship
An ER term used to describe an association within an entity

weak entity
An entity that displays existence dependence and inherits the primary key of its parent eneity

weak relationship
A relationship in which the primary key of the related entity does not contain a primary key component of the parent entity

master key

atomic attribute
An attribute that cannot be further subdivided to produce meaningful components

atomicity
The transaction property that requires all parts of a transaction to be treated as a single, indivisible, logical unit of work. All parts of a transaction must be completed or the entire transaction is aborted

Boyce-Codd normal form (BCNF)
A special type of third normal form (3NF) in which every determinant is a candidate key

denormilization
A process by which a table is changed from a higher-level normal form to a lower-level normal form, usually to increase processing speed

dependency diagram
A representation of all data dependencies (primary key, partial, or transitive) within a table

determinant
Any attribute in a specific row whose value directly determines other values in that row

first normal form (1NF)
The first stage in the normalization process. It describes a relation depicted in tabular format,, with no repeating groups and a primary key identified. All non-key attributes in the relation are dependent on the primary key

fourth normal form (4NF)
A table that is in 3NF and contains no multiple independent sets of multivalued dependencies

granularity
The level of detail represented by the values stored in a table’s row. Data stored at its lowest level is said to be atomic data

nonprime (or nonkey) attribute
An attribute that is not part of a key

normalization
A process that assigns attributes to entities so that data redundancies are reduced or eliminated

partial dependency
A condition in which an attribute is dependent on only a portion (subset) of the primary key

repeating group
In a relation, a characteristic describing a group of multiple entries of the same type for a single key attribute occurrence

second normal form (2NF)
The second stage in the normalization process, in which a relation is in 1NF and there are no partial dependencies

third normal form (3NF)
A table in 2NF and no nonkey attribute is functionally dependent on another nonkey attribute

transitive dependency
A condition in which an attribute is dependent on another attribute that is not part of the primary key

alias
An alternative name for a column or table in a SQL statement

ALTER TABLE
The SQL command used to make changes to table structure. When the command is followed by a keyword, it adds a column or changes column characteristics

AND
The SQL logical operator used to link multiple conditional expressions in a WHERE or HAVING clause. It requires that all conditional expressions evaluate to true

AVG
A SQL aggregate function that outputs the mean for a specified column or expression

BETWEEN
In SQL, a special comparison operator used to check whether a value is within a range of specified values

Boolean algebra
A branch of mathematics that uses the logical operators OR, AND, and NOT

cascading order sequence
A nested ordering sequence for a set of rows, such as a list in which all last names are alphabetically ordered and, within the last names, all first names are ordered

COMMIT
The SQL command that permanently writes data changes to a database

COUNT
A SQL aggregate function that outputs the number of rows containing not null values for a given column or expression, sometimes used in conjunction with the DISTINCT clause

CREATE INDEX
A SQL command that creates indexes on the basis of a selected attribute or attributes

CREATE TABLE
A SQL command that creates a table’s structures using the characteristics and attributes given

DELETE
A SQL command that allows data rows to be permanently removed from a table

DISTINCT
A SQL clause that produces only a list of values that are different from one another

DROP INDEX
A SQL command used to delete database objects such as tables, views, indexes, and users

DROP TABLE
A SQL command used to delete database objects such as tables, views, indexes, and users

EXISTS
In SQL, a comparison operator that checks whether a subquery returns any rows

FROM
A SQL clause that specifies the table or tables from which data is to be retrieved

GROUP BY
A SQL clause used to create frequency distributions when combined with any of the aggregate functions in a SELECT statement

HAVING
A clause applied to the output of a GROUP BY operation to restrict selected rows

IN
In SQL, a comparison operator used to check whether a value is among a list of specified values

inner query
A query that is embedded or nested inside another query

INSERT
A SQL command that allows the addition of one or more data rows into a table

International Organization for Standardization (ISO)
An organization formed to develop standards for diverse network systems

IS NULL
In SQL, a comparison operator used to check whether an attribute’s text value matches a specified string pattern

MAX
A SQL aggregate funciton that yields the maximum attribute value in a given column

MIN
A SQL aggregate function that yields the minimum attribute value in a given column

nested query
In SQL, a query that is embedded in another query

NOT
A SQL logical operator that negates a given predicate

OR
The SQL logical operator used to link multiple conditional expressions in a where or HAVING clause. It requires only one of the conditional expressions to be true

ORDER BY
A SQL clause that is useful for sorting the output of a SELECT query

ROLLBACK
A SQL command that restores the database table contents to the condition that existed after the last COMMIT statement

rules of precedence
Basic algebraic rules that specify the order in which operations are performed

subquery
A query that is embedded (or nested) inside another query

SUM
A SQL aggregate function that yields the total of all values for given column or expression

UPDATE
A SQL command that allows attribute values to be changed in one or more rows of a table

WHERE
A SQL clause that adds conditional restrictions to a SELECT statement that limit the rows returned by the query

wildcard character
A symbol that can be used as a general substitute

business intelligence (BI)
A comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate, store, and analyze data with the purpose of generating and presenting information to support business decision making

cube cache
In multidimensional OLAP, the shared, reserved memory area where data cubes are held

dashboards
In business intelligence, a web-based system that presents key business performance indicators or information in a single, integrated view with clear and concise graphics

data cube
The multidimensional data structure used to store and manipulate data in a multidimensional DBMS. The location of each data value is based on its x y and z axes. They are static, meaning they must be created before they are used, so they cannot be created by an ad hoc query

data mart
A small, single-subject data warehouse subset that provides decision support to a small group of people

data visualization
Abstracting data to provide information in a visual format that enhances the user’s ability to effective comprehend the meaning of the data

data warehouse
An integrated, subject-oriented, time-variant, nonvolatile collection of data in a specialized database that stores historical and aggregate data in a format that provides support for decision making

decision support system (DSS)
An arrangement of computerized tools used to assist managerial decision making within a business

dimension tables
In a data warehouse, tables used to search, filter, or classify facts within a star schema

drill down
To decompose data into more atomic components – that is, data at lower levels of aggregation

extraction, transformation, and loading (ETL)
In a data warehousing environment, the integrated processes of getting data from original sources into the data warehouse. It includes retrieving data from original data sources, manipulating the data into an appropriate form, and storing the data in the data warehouse

governance
In business intelligence, the methods for controlling and monitoring business health and promoting consistent decision making

key performance indicators (KPIs)
In business intelligence, quantifiable numeric or scale-based measurements that assess a company’s effectiveness or success in reaching strategic and operational goals.

master data management (MDM)
In business intelligence, a collection of concepts, techniques, and processes for the proper identification, definition, and management of data elements within an organization

multidimensional database management systems (MDBMSs)
A database management system that uses proprietary techniques to store data in matrix-like arrays of n dimensions known as cubes

multidimensional online analytical processing (MOLAP)
An extension of online analytical processing to multidimensional database management systems

portals
In terms of business intelligence, a unified, single point of entry for information distribution

relational online analytical processing (ROLAP)
Analytical processing functions that use relational databases and familiar relational query tools to store and analyze multidimensional data

roll up
1) To aggregate data into summarized components, that is, higher levels of aggregation; 2) In SQL, an OLAP extension used with the GROUP BY clause to aggregate data by different dimensions; 3) opposite of drill down

slice and dice
The ability to focus on slices of a data cube (drill down or roll up) to perform a more detailed analysis

sparsity
In multidimensional data analysis, a measurement of the data density held in the data cube

very large databases (VLDBs)
Database that contains huge amounts of data – gigabyte, terabyte, and petabyte ranges are not unusual

Leave a Comment

Scroll to Top