D426 data management foundations oa exam questions and answers
D426 data management foundations oa exam questions
D426 data management foundations oa exam answers
data management – foundations – d426
data management – foundations – d426 reddit
data management – foundations – c175 quizlet
data management – foundations – c175 reddit
data management foundations wgu
3. What characterizes the rules of relational databases?
-They are logical constraints that ensure the data is valid.
-They are based on business policy and specific databases.
-They represent data volumes and rapidly changing data structures.
-They embody the theoretical foundation of the SQL language.
They are logical constraints that ensure the data is valid.
4. Which event will result in an error code and error description when using MySQL Server?
When the server or updates are incorrectly installed
When an SQL statement is entered to locate errors in the database
When an SQL statement is syntactically incorrect
When a Mac OS shortcut code is used on a Windows OS
When an SQL statement is syntactically incorrect
5. Which data type should be used to store whole integer values, such as age?
NUM
VARCHAR
DATE
INT
Data Transaction Language (DTL)
12. Which MySQL operator is the last in the order of operator precedence?
–
OR
NOT
=
13. Which operator is used to compare columns from the left and right tables in MySQL?
<=
+
=
*
14. Which type of join combines two tables without comparing columns?
SELF
EQUIJOIN
OUTER
CROSS
17. What is another name for a subquery?
Outer query
Nested query
Correlated query
Search query
20. Which function is considered an aggregating function in SQL?
TRIM
REPLACE
MIN
SUBSTRING
UPDATE Employee SET Salary = 50000 WHERE ID = 1;
26. What describes elements such as column name and data type?
Alter
Table
Metadata
CRUD
28. Which SQL command uses the correct syntax to select the name and salary columns from the Employee table where the salary is greater than 50000, and sort the results by salary in descending order?
-SELECT name, salary FROM Employee WHERE salary > 50000 ORDER + BY salary DESC;
-SELECT Employee.name, Employee.salary WHERE Employee.salary > 50000 SORT BY Employee.salary DESC;
-SELECT name, salary FROM Employee SORT BY salary DESC WHERE salary > 50000;
-SELECT name, salary FROM Employee WHERE salary > 50000 SORT BY salary ASC;
SELECT name, salary FROM Employee WHERE salary > 50000 ORDER + BY salary DESC;
INSERT INTO table_name (column1, column2) VALUES (value1, value2);
30. Which UPDATE statement uses valid syntax in SQL?
UPDATE table_name column_name= value1 WHERE condition
UPDATE table_name SET column1 = value1, column2 = value2 WHERE condition;
UPDATE SET column1 = value1, column2 = value2 WHERE condition IN table_name;
UPDATE table_name SET column_name = value1
UPDATE table_name SET column1 = value1, column2 = value2 WHERE condition;
31. Which statement used to remove data from temporary tables?
DROP
DELETE
DEFAULT
TRUNCATE
32. Which keyword is a DDL (data definition language) keyword in SQL?
INSERT
UPDATE
DELETE
CREATE
33. Which property is enforced by a primary key field?
Not Null
Null
Duplicate
Numeric
It is a first normal form table.

In the direction the relationship verb is facing

Associative Entity
An associative entity is an element of the entity-relationship model. All relationships for the associative entity should be many.
Attribute
An attribute is a property or characteristic of an entity.
Binary relationship
A binary relationship is a relationship between two entity types.
Candidate Key
Is any column or a combination of columns that can qualify as unique key in database.
Cardinality
Cardinality represents the maximum number of entities that can be involved in a particular relationship.
Cartesian product
Usually the result of a missing join condition or a method of expanding the data of 1 table by the number of rows in the second table.
Cascade Delete
Will delete all records that reference the primary key
column subquery
Returns a single column of one or more values.
data encryption
When data is encrypted, it is changed, bit by bit or character by character, into a form that looks totally garbled. It can and must be reconverted, or decrypted, back to its original form to be of use.
Data normalization
Is a methodology for organizing attributes into tables so that redundancy among the non-key attributes is eliminated.
Data volatility
Describes how often stored data is updated.
Data Volume Assessment
Understanding of how much data will be in a database or a table within a database
Database
A database is a collection of information that is organized so that it can easily be accessed, managed, and updated.
DCL
Data control language is used to control access to data stored in a database.
DDL
Data definition language – involves instructing the DBMS software on what tables will be in the database, what attributes will be in the tables, which attributes will be indexed, and so forth.
Definer
Definer is a MySQL term where AuthID is the same for another DBMS
Denormalization
The act of duplicating data in a database for performance or the preservation of historical information
disaster recovery
Involves rebuilding an entire information system or significant parts of one after a catastrophic natural disaster such as a hurricane, tornado, earthquake, building collapse, or even a major fire.
DML
Data manipulation language refers to the four basic operations that can and must be performed on data stored in any DBMS: data retrieval, data update, insertion of new records, and deletion of existing records.
E-R model
An E-R model is a data model for describing a database in an abstract way.
embedded mode
The SELECT command is embedded within the lines of a higher-level language program and functions as an input or “read” statement for the program.
Entity
An entity is an object or event in our environment that we want to keep track of.
Foreign Key
Is a field (or collection of fields) in one table that uniquely identifies a row of another table.
Index
Used to help a DBMS find data quickly
Inner Join
Shows row that have matches in both tables
Intersection Data
Intersection Data associated with the concatenation of two segments.
Join
Joins 2 tables together
logical view
Is a mapping onto a physical table or tables that allows an end user to access only a specified portion of data.
Modality
Modality represents the minimum number of entity occurrences that can be involved in a relationship.
outer join
Shows rows in one table that have no match in the other table. Two kinds of outer joins are left and right joins.
Primary Key
Uniquely identifies each record in the table.
query mode
The command goes directly to the relational DBMS, which evaluates the query and processes it against the database.
Referential Integrity
Referential integrity is a database concept that ensures that relationships between tables remain consistent.
referential integrity
Enforces rules to guarantee that the foreign key relationship stays intact with no mismatches.
Response time
Is the delay from the time that the Enter Key is pressed to execute a query until the result appears on screen.
Restrict Delete
Will not allow deletes if the primary key is referenced
row subquery
Returns a single row of one or more values.
scalar subquery
is the most restrictive subquery because it produces only a single value
Set-to-Null on Delete
Will set values to null when primary key is deleted
SQL
Is a comprehensive database management language which incorporates DML and DDL
subquery
One SELECT statement is “nested” within another.
table subquery
Returns a table of one or more rows of one or more columns.
Ternary Relationship
A ternary relationship is when three entities participate in the relationship.
Unary Relationships
Unary relationships associate occurrences of an entity type with other occurrences of the same entity type.
One-to-One Binary Relationship
It means that a single occurrence of one entity type can be associated with a single occurrence of the other entity type and vice versa.
Throughput
Is the measure of how many queries from simultaneous users must be satisfied in a given period of time by the application set and the database that it supports.
UNION
To create a result set that combines the results from several queries
Unique Identifier
A unique identifier is any identifier which is guaranteed to be unique among all identifiers used for those objects and for a specific purpose.
Association Rules
Descriptive – discovers links or associations amongst data
Business Intelligence
The transformation of raw data into useful information.
Classification
Predictive – maps data into predefined groups or classes
Clustering
Descriptive – groups similar data together into clusters
Extraction/Transformation/Loading
ETL is the process of extracting raw data and then transforming and loading into a target to be used with Business intelligence.
Regression
Predictive – used to map a data item to a real valued prediction variable
Sequence Discovery
Descriptive – discovers sequential patterns
Summarization Rules
Descriptive – maps data into subsets with associated simple descriptions or generalizations.
Time Series Analysis
Predictive – analysis information over time to predict future data
- in the SELECT clause
It indicates that all attributes of the selected row are to be retrieved.
AND operator
It displays a record if more than one condition is true.
AVG() function
It returns the average value of a numeric column.
BETWEEN operator
It allows you to specify a range of numeric values in a search.
Data definition
It is operationalized with a data definition language (DDL), involves instructing the DBMS software on what tables will be in the database, what attributes will be in the tables, which attributes will be indexed, and so forth.
Data management
There are two aspects of data management: data definition and data manipulation.
DISTINCT operator
It is used to eliminate duplicate rows in a query result.
IN operator
It allows you to specify a list of character strings to be included in a search.
JOIN clause
It is used to combine rows from more than one table, based on a common field between them.
LIKE operator
It allows you to specify partial character strings in a “wildcard” sense.
OR operator
It displays a record if either the first condition OR the second condition is true.
ORDER BY clause
It simply takes the results of a SQL query and orders them by one or more specified attributes.
SELECT command
Data retrieval in SQL is accomplished with the SELECT command.
Structured Query Language
It is a standard language for data management in relational databases, known as Structured Query Language or SQL.
Subquery
When one SELECT statement is “nested” within another in a format, it is known as subquery.
Data
Facts gathered together for analysis
Flat Files
A file having no internal hierarchy
Hashed Files
A file that has been encrypted for security purposes
Heap Files
An unsorted set of records
Information
The transformation of raw data into useful facts
Punch Card
A card that is perforated and can hold commands or data
Structured Data
Information with a high degree of organization
Unstructured Data
Information that does not have structure (such as text)
ETL process
Get the data from the source location.
Map the data from its original form into a data model that is suitable for manipulation at the staging area.
Validate and clean the data.
Apply any transformations to the data that are required before the data sets are loaded into the repository.
Map the data from its staging area model to its loading model.
Move the data set to the repository.
Load the data into the warehouse.
Estimation
It is a process of assigning some continuously valued numeric value to an object.
Description
It is the process of trying to characterize what has been discovered or trying to explain the results of the data mining process.
Affinity grouping
It is a process of evaluating relationships or associations between data elements.
They can serve as a communication tool between the users and designers.
Which of the following is true of business rules?
entity
A(n) _ is anything about which data are to be collected and stored.
XML/Hybrid data model
Oracle 11g is an example of the _.
entity instance
Each row in the relational table is known as a(n)
relational data model
MySQL is an example of the _.
attribute
A(n) _ is the equivalent of a field in a file system.
entity relationship
The _ model was developed to allow designers to use a graphical tool to examine structures rather than describing them with text.
entity relationship
The _ model uses the term connectivity to label the relationship types.
1980s
The object-oriented data model was developed in the _ .
network
In the _ model, the user perceives the database as a collection of records in 1:M relationships, where each record can have more than one parent.
constraints
_ are important because they help to ensure data integrity.
Crow’s Foot notation
In __, a three pronged symbol represents the “many” side of the relationship.
data definition language (DDL)
A(n) _ enables a database administrator to describe schema components.
file system data model
VMS/VSAM is an example of the _.
1960s
The hierarchical data model was developed in the _.
hierarchical
One of the limitations of the _ model is that there is a lack of standards.
entity
A(n) _ represents a particular type of object in the real world.
data manipulation language (DML)
A(n) __ defines the environment in which data can be managed and is used to work with the data in the database.
constraint
A(n) _ is a restriction placed on the data.
They are not based on the relational model.
Which of the following is true of NoSQL databases?
provide fault tolerance.
NoSQL databases:
relationship
A verb associating two nouns in a business rule translates to a(n) _ in the data model.
business rule
A(n) _ is a brief, precise, and unambiguous description of a policy, procedure, or principle within a specific organization.
business rules
From a database point of view, the collection of data becomes meaningful only when it reflects properly defined _ .
It does not have standards
Which of the following is a disadvantage of the hierarchical data model?
relation
The relational model’s foundation is a mathematical concept known as a
relationship
A(n) _ is bidirectional.
1970s
The relational data model was developed in the _.
hierarchical
In the _ model, each parent can have many children, but each child has only one parent.
client node
Which of the following types of HDFS nodes acts as the interface between the user application and the HDFS?
attribute
Each column in a relation represents a(n)
method
In object oriented terms, a(n) __ defines an object’s behavior.
tuple
Each row in a relation is called a(n)
data model
A(n) __ is a relatively simple representation of more complex real-world data structures.
constraints
_ are normally expressed in the form of rules.
many-to-many
Students and classes have a _ relationship.
model
A(n) __‘s main function is to help one understand the complexities of the real world environment.
class
A(n) _ is a collection of similar objects with a shared structure and behavior.
entity
A noun in a business rule translates to a(n) _ in the data model.
schema
A(n) _ is the conceptual organization of an entire database as viewed by a database administrator.
name node
Which of the following types of HDFS nodes stores all the metadata about a file system?
hierarchical
In the _ model, the basic logical structure is represented as an upside-down tree.
object-oriented
The _ data model is said to be a semantic data model.
segment
A(n) _ in a hierarchical model is the equivalent of a record in a file system.
object-oriented
The _ data model uses the concept of inheritance.
unique
When you define a table’s primary key, the DBMS automatically creates a(n) _ index on the primary key column(s) you declared.
data dictionary
A(n) _ provides a detailed description of all tables found within the user/designer-created database.
1:M
The _ relationship is the relational database norm.
candidate
A _ key can be described as a minimal superkey, a superkey without any unnecessary attributes.
M:N
________relationships cannot be implemented as such in the relational model.
bridge
Another name for a composite entity is a(n) _ entity.
1:M
The _ relationship is the “relational model ideal.”
secondary
A _ key is defined as a key that is used strictly for data retrieval purposes.
entity
The CUSTOMER table’s primary key is CUS_CODE. The CUSTOMER primary key column has no null entries, and all entries are unique. This is an example of _ integrity.
system
The _ catalog can be described as a detailed system data dictionary that describes all objects within the database, including data about table names, the table’s creator and creation date, the number of columns in each table, the data type corresponding to each column, index filenames, index creators, authorized users, and access privileges.
M:N
_ relationships can be implemented by creating a new entity in 1:M relationships with the original entities.
predicate
_ logic, used extensively in mathematics, provides a framework in which an assertion (statement of fact) can be verified as either true or false.
index
A(n) _ is an orderly arrangement used to logically access rows in a table.
keys
In the relational model, _ are important because they are used to ensure that each row in a table is uniquely identifiable.
superkey
A _ is any key that uniquely identifies each row.
1:1
If one department chair “a professor “can chair only one department, and one department can have only one department chair. The entities PROFESSOR and DEPARTMENT exhibit a(n) _ relationship.
column
Each table _ represents an attribute.
attributes
An index key can have multiple _ (a composite index).
integrity
Referential _ dictates that the foreign key must contain values that match the primary key in the related table, or must contain null.
keys
In a relational model, _ are also used to establish relationships among tables and to ensure the integrity of the data
attribute
In a relational table, each column has a specific range of values known as the _ domain.
relation
A table is also called a(n) _ because the relational model’s creator, E. F. Codd, used the two terms as synonyms.
foreign key
A _ is the primary key of one table that has been placed into another table to create a common attribute.
candidate
A primary key is a(n) _ key chosen to be the primary means by which rows of a table are uniquely identified.
NOT NULL
The _ constraint can be placed on a column to ensure that every row in the table has a value for that column.
1:1
The _ relationship should be rare in any relational database design.
Julian
Date attributes contain calendar dates stored in a special format known as the _ date format.
unique
A(n) __ index is an index in which the index key can have only one pointer value (row) associated with it.
foreign
The proper use of _ keys is crucial to controlling data redundancy.
table
You can think of a _ as a persistent representation of a logical relation.
RDBMSs
_ enforce integrity rules automatically.
warehousing
Proper data __ design requires carefully defined and controlled data redundancies to function properly.
synonym
In a database context, a(n) _ indicates the use of different names to describe the same attribute.
flags
To avoid nulls, some designers use special codes, known as _ , to indicate the absence of some value.
1:1
One characteristic of generalization hierarchies is that they are implemented as __ relationships.
system catalog
The _ is actually a system-created database whose tables store the user/designer-created database characteristics and contents.
derived
The decision to store _ attributes in database tables depends on the processing requirements and the constraints placed on a particular application.
small circle
In Crow’s Foot notation, an optional relationship between entities is shown by drawing a(n) _ on the side of the optional entity.
one
Ideally, an entity identifier is composed of _ attribute(s).
entity
Knowing the minimum and maximum number of _ occurrences is very helpful at the application software level.
binary
To simplify the conceptual design, most higher-order relationships are decomposed into appropriate equivalent _ relationships whenever possible.
1:1
The Crow’s foot symbol with two vertical parallel lines indicates _ cardinality.
existence
A weak entity must be _
dependent.
dashed line
A derived attribute is indicated in the Chen notation by a _ that connects the attribute and an entity.
degree
A relationship
__ indicates the number of entities or participants associated with a relationship.
optional
Participation is _ if one entity occurrence does not require a corresponding entity occurrence in a particular relationship.
iterative
What process is based on repetition of processes and procedures.
symbols
When the specific cardinalities are not included on the diagram in Crow’s Foot notation, cardinality is implied by the use of _.
single-valued
A person’s Social Security number would be an example of a(n) _ attribute.
minimum
When indicating cardinality, the first value represents the _ number of associated entities.
unary
A _ relationship exists when an association is maintained within a single entity.
primary key
The concept of relationship strength is based on how the _ of a related entity is defined.
optional
The existence of a(n) _ entity indicates that its minimum cardinality is zero.
cardinality
_ expresses the minimum and maximum number of entity occurrences associated with one occurrence of the related entity.
rectangle
The Chen notation identifies a weak entity by using a double-walled entity __
ternary
A _ relationship exists when three entities are associated.
composite
A _ attribute can be further subdivided to yield additional attributes.
UML
The _ notation of entity-relationship modelling can be used for both conceptual and implementation modelling.
required
A(n) __ attribute is an attribute that must have a value.
solid
When using the Crow’s Foot notation, the associative entity is indicated by _ relationship lines between the parents and the associative entity.
information
Complex _ requirements may dictate data transformations, and they may expand the number of entities and attributes within the design.
relationships
Identifying the attributes of entities helps in the better understanding of _ among entities.
M:N
The entity relationship model uses the associative entity to represent a(n) _ relationship between two or more entities.
recursive
If an employee within an EMPLOYEE entity has a relationship with itself, that relationship is known as a _ relationship.
participation
Failure to understand the distinction between mandatory and optional __ in relationships might yield designs in which awkward (and unnecessary) temporary rows (entity instances) must be created just to accommodate the creation of required entities.
derived
A(n) _ attribute need not be physically stored within the database.
attributes
_ are characteristics of entities.
creating a detailed narrative of the organization’s description of operations
The first step in building an entity-relationship diagram (ERD) is _ .
classification
A relationship _ is difficult to establish if only one side of the relationship is known.
strong
If an entity can exist apart from all of its related entities, then it is existence-independent, and it is referred to as a(n) _ entity.
high processing speeds
In organizations that generate large number of transactions, _ are often a top priority in database design.
weak
A _ entity has a primary key that is partially or totally derived from the parent entity in the relationship.
solid
The Crow’s Foot notation depicts the strong relationship with a(n) __ line between the entities.
documentation
_ not only helps database designers to stay on track during the design process, it also enables them to pick up the design thread when the time comes to modify the design.
conceptual
The entity relationship diagram (ERD) represents the _ database as viewed by the end user.
domain
A(n) _ is the set of possible values for a given attribute.
optional to
If Tiny College has some departments that are classified as “research only” and do not offer courses, the COURSE entity of the college database would be _ the DEPARTMENT entity.
strong
A(n) _ relationship is also known as an identifying relationship.
M:N
The conceptual model can handle _ relationships and multivalued attributes.
simple
A _ attribute is one that cannot be subdivided.
identifiers
_ are underlined in an ER diagram.
existence
An entity is said to be _ -dependent if it can exist in the database only when it is associated with another related entity occurrence.
entities
A relationship is an association between _.
primary
Because a partial dependency can exist only when a table’s primary key is composed of several attributes, a table whose __ key consists of only a single attribute is automatically in 2NF once it is in 1NF.
partial dependency
A dependency based on only a part of a composite primary key is called a
normal forms
Normalization works through a series of stages called
prime
An attribute that is part of a key is known as a(n) _ attribute.
first normal form
All relational tables satisfy the _ requirements.
partial dependency
A dependency based on only a part of a composite primary key is known as a _
anomalies
A table that displays data redundancies yields _.
4NF
Some very specialized applications may require normalization beyond the _.
3NF
A table that is in 2NF and contains no transitive dependencies is said to be in _.
repeating group
A relational table must not contain a(n) _.
partial
Dependencies based on only a part of a composite primary key are known as _ dependencies.
single entity
Repeating groups must be eliminated by ensuring that each row defines a
granularity
refers to the level of detail represented by the values stored in a table’s row.
Crow Foot notation ERDs
When designing a new database structure based on the business requirements of the end users, the database designer will construct a data model using a technique such as __.
3NF
For most business transactional databases, we should normalize relations into _.
transitive dependency
A(n) _ exists when there are functional dependencies such that Y is functionally dependent on X, Z is functionally dependent on Y, and X is the primary key.
anomalies
The problem with transitive dependencies is that they still yield data
4NF
In the __ , no row may contain two or more multivalued facts about an entity.
three
If you have three different transitive dependencies, _ different determinant(s) exist.
2NF
A table that is in 1NF and includes no partial dependencies is said to be in _.
1NF
A table that has all key attributes defined, has no repeating groups, and all its attributes are dependent on the primary key is said to be in _.
partial dependency
A(n) _ exists when there are functional dependencies such that XY is functionally dependent on WZ, X is functionally dependent on W, and XY is the primary key.
the level of detail represented by the values in a table’s row
Granularity refers to _.
2NF
Before converting a table into 3NF, it is imperative that the table already be in _.
normalization
is a process to help reduce the likelihood of data anomalies.
DKNF
Of the following normal forms, _ is mostly of theoretical interest.
three
Normalization works through a series of stages called normal forms. For most purposes in business database design, _ stages are as high as you need to go in the normalization process.
prime attribute
Any attribute that is at least part of a key is known as a __.
2NF
It is possible for a table in __ to exhibit transitive dependency, where the primary key may rely on one or more nonprime attributes to functionally determine other nonprime attributes.
atomicity
An attribute that cannot be further subdivided is said to display __.
determinant
Any attribute whose value determines other values within a row is known as a __.
normalization stages
1NF, 2NF, and 3NF are _.
dependency
In a(n) _ diagram, the arrows above the attributes indicate all desirable dependencies.
derived
From a system functionality point of view, _ attribute values can be calculated when they are needed to write reports or invoices.
normalization
The objective of __ is to ensure that each table conforms to the concept of well-formed relations.
3NF
Most designers consider the BCNF as a special case of the _.
make sure that entities are in normal form before table structures are created
When designing a database, you should _.
surrogate
In a real-world environment, changing granularity requirements might dictate changes in primary key selection, and those changes might ultimately require the use of _ keys.
data integrity problems
Data redundancy produces _ .
cannot be further subdivided
An atomic attribute _ .
3NF and the BCNF
When a table contains only one candidate key, _ are considered to be equivalent.
relational
All __ tables satisfy the 1NF requirements.
atomicity
Improving _ leads to more flexible queries.
natural
A(n) _ join links tables by selecting only the rows with common values in their common attribute(s).
diagram
Dependencies can be identified with the help of a dependency _.
repeating group
A _ derives its name from the fact that a collection of multiple entries of the same type can exist for any single key attribute occurrence.
candidate
BCNF can be violated only if the table contains more than one _ key.
determines
Attribute A _ attribute B if all of the rows in the table that agree in value for attribute A also agree in value for attribute B.
BCNF
A table is in __ if every determinant in the table is a candidate key.
1NF
From a structural point of view, 2NF is better than _.
concept of keys
The _ is central to a discussion of normalization.
2NF
From a structural point of view, 3NF is better than _.
partial dependencies
If a table has multiple candidate keys and one of those candidate keys is a composite key, the table can have _ based on this composite candidate key even when the primary key chosen is a single attribute.
LIKE
The special operator used to check whether an attribute value matches a given string pattern is _.
SELECT
To make the output more readable, the SQL standard permits the use of aliases for any column in a __ statement.
WHERE
ANSI-standard SQL allows the use of special operators in conjunction with the __ clause.
restrictions
You can select partial table contents by naming the desired fields and by placing __ on the rows to be included in the output.
logical
SQL allows the use of __ restrictions on its inquiries such as OR, AND, and NOT.
Performing operations within parentheses
According to the rules of precedence, which of the following computations should be completed first?
SELECT
The SQL command that allows a user to list the contents of a table is _.
queries
A database language enables the user to perform complex __ designed to transform the raw data into useful information.
HAVING
The SQL data manipulation command __ restricts the selection of grouped rows based on a condition:
ISO
The ANSI SQL standards are also accepted by the _.
IS NULL
The _ special operator is used to check whether an attribute value is null.
COMMIT
The SQL command that allows a user to permanently save data changes is _.
cascading
A(n) __ order sequence is a multilevel ordered sequence that can be created easily by listing several attributes, separated by commas, after the ORDER BY clause.
SELECT
The _ command, coupled with appropriate search conditions, is an incredibly powerful tool that enables a user to transform data into information.
EXISTS
The special operator used to check whether a subquery returns any rows is _.
alias
A(n) _ is an alternate name given to a column or table in any SQL statement.
Microsoft Access
Some RDBMSs, such as _ , automatically make the necessary conversions to eliminate case sensitivity.
one hundred
The basic SQL vocabulary has fewer than __ words.
INSERT
The SQL command that allows a user to insert rows into a table is _.
query
In the SQL environment, the word _ covers both questions and actions.
BETWEEN
The special operator used to check whether an attribute value is within a range of values is _.
Boolean
A specialty field in mathematics, known as __ algebra, is dedicated to the use of logical operators.
conditional
in SQL, all __ expressions evaluate to true or false.
Business intelligence
_ tools focus on the strategic and tactical use of information.
data warehouse
Business intelligence tools use the _ data as the raw materials for data analytics to generate business knowledge.
Business intelligence
__ functionality ranges from simple data gathering and transformation to very complex data analysis and presentation.
framework
Business intelligence is a __ that allows a business to transform data into information, information into knowledge, and knowledge into wisdom
architecture
Business intelligence (BI) __ is composed of data, people, processes, technology, and the management of such components.
Business intelligence
_ is a term used to describe a comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate, store, and analyze data with the purpose of generating and presenting information used to support business decision making.
DBMS
A(n) __ is a collection of programs that manages the database structure and controls access to the data stored in the database.
single-user
A desktop database is a _ database.
structured data
_ are the result of formatting disorganized data in order to facilitate storage, use and generation of information.
query result set
The response of the DBMS to a query is the __.
unstructured
_ data exist in the format in which they were collected.
DBMSs
_ serve as the intermediary between the user and the database.
enterprise
A(n) __ database is used by an organization and supports many users across many departments.
independence
Data __ exists when it is possible to make changes in the data storage characteristics without affecting an application program’s ability to access data.
unstructured
_ data exist in a format that does not lend itself to processing that yields information.
context
To reveal meaning, information requires _.
metadata
__ describe the data characteristics and the set of relationships that links the data found within the database.
record
A _ is a logically connected set of one or more fields that describes a person, place, or thing.
ad hoc query
A(n) __ is a spur-of-the-moment question.
database system
The term _ refers to an organization of components that define and regulate the collection, storage, management and use of data within a database environment.
data redundancy
exists when the same data are stored unnecessarily at different places.
They enhance the user’s ability to understand the data.
Which of the following is true of spreadsheet applications?
its expected use
The organization of data within folders in a manual file system is determined by _.
operational databases
Data warehouse contains historical data obtained from the_____.
multiuser
A workgroup database is a(n) _ database.
good decision making
Accurate, relevant, and timely information is the key to _.
field
A _ is a character or group of characters that has a specific meaning.
application
A(n) _ might be written by a programmer or it might be created through a DBMS utility program.
performance tuning
_ relates to the activities that make the database execute transactions more efficiently in terms of storage and access speed.
the data always yields consistent results
Data is said to be verifiable if:
raw facts of interest to the end user
End-user data is __.
workgroup
A(n) _ database supports a relatively small number of users (usually fewer than 50) or a specific department within an organization.
data dictionary
The DBMS uses the __ to look up the required data component structures and relationships, thus relieving programmers from having to code such complex relationships in each program.
data inconsistency
_ exists when different and conflicting versions of the same data appear in different places.
In-memory databases
_ refer to a type of database that stores most of its data in RAM rather than in hard disks.
metadata
__ provide a description of the data characteristics and the set of relationships that link the data found within the database.
anomaly
A(n) _ develops when all required changes in the redundant data are not made successfully.
information
_ is the result of revealing the meaning of raw facts.
data
Information is produced by processing _.
analytical
__ databases focus primarily on storing data used to generate information required to make tactical or strategic decisions.
data dictionary
A _ contains at least all of the attribute names and characteristics for each table in the system.
information
__ is the result of processing raw data to reveal its meaning.
knowledge
_ is the body of information and facts about a specific subject.
data redundancy
Which of the following refers to the situation where the same data is stored unnecessarily at different places?
data inconsistency
_ exists when different versions of the same data appear in different places.
data integrity
_ is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.
collection of files
The database structure in a DBMS is stored as a __.
file
A _ is a collection of related records.
semistructured
An XML database supports the storage and management of _ XML data.
XML
__ is a special language used to represent and manipulate data elements in a textual format.
formatted
Raw data must be properly __ for storage, processing and presentation.
query
A(n) __ is a specific request issued to the DBMS for data manipulation
semistructured
Most data that can be encountered are best classified as _ .
islands of information
The term __ refers to scattered locations storing the same basic data.
metadata
__ is data about data through which the end-user data are integrated and managed
performance tuning
_ relates to activities that make a database operate more efficiently in terms of storage and access speed.
ad hoc query
A “spur-of-the-moment” question
analytical database
A database focused primarily on storing historical data and business metrics used for tactical or strategic decision making
availability
In the context of data security, it refers to the accessibility of data whenever required by authorized users and for authorized purposes
centralized database
A database located at a single site
cloud database
A database that is created and maintained using cloud services, such as Microsoft Azure or Amazon AWS
data
Raw facts, or facts that have not yet been processed to reveal their meaning to the end user
data anomaly
An abnormality of the data in which inconsistent changes have been made to a database (For example, an employee moves, but the address changes is not corrected in all files in the database)
data dependence
A data condition in which data representation and manipulation are dependent on the physical data storage characteristics
data dictionary
A DBMS component that stores metadata. Contains the data definition, as well as their characteristics and relationships
data inconsistency
A condition in which different versions of the same data yield different results
data independence
A condition in which data access is unaffected by changes in the physical data storage characteristics
data integrity
In a relational database, a condition in which the data in the database complies with all entity and referential integrity constraints
data management
A process that focuses on data collection, storage, and retrieval.
data processing (DP) specialist
The person responsible for developing and managing a computerized file processing system
data redundancy
Exists when the same data is stored unnecessarily at different places
database
A shared, integrated computer structure that houses a collection of related data including end-user data (raw facts) and metadata (data about the data)
database design
The process that yields the description of the database structure and determines the database components. The second phase of the database life cycle
database management system (DBMS)
The collection of programs that manages the database structure and controls access to the data stored in the database
database system
An organization of components that defines and regulates the collection, storage, management, and use of data in a database environment
desktop database
A single-user database that runs on a personal computer
discipline-specific database
A database that contains data focused on specific subject areas
enterprise database
The overall company data representation, which provides support for present and expected future needs
field
An alphabetic or numeric character or group of characters that defines a characteristic of a person, place, or thing. (for example, a person’s SSN, address, phone number, and bank balance all constitute fields)
file
A named collection of related records
general-purpose database
A database that contains a wide variety of data used in multiple disciplines
hub
A warehouse of data packets housed in a central location on a local area network. It contains multiple ports that copy the data in the data packets to make it accessible to selected or all segments of the network
information
The result of processing raw data to reveal its meaning. It consists of transformed data and facilitates decision making
islands of information
In the old file system environment, pools of independent, often duplicated, and inconsistent data created and managed by different departments
knowledge
The body of information and facts about a specific subject. It implies familiarity, awareness, and understanding of information as it applies to an environment
logical data format
The way a person views data within the context of a problem domain
metadata
Data about data
multiuser database
A database that supports multiple concurrent users
NoSQL
A new generation of database management systems that is not based on the traditional relational database model. It stores structured and unstructured data in efficient ways.
online analytical processing (OLAP)
Decision support system (DSS) tools that use multidimensional data analysis techniques. It creates an advanced data analysis environment that supports decision making, business modeling, and operations research
operational database
A database designed primarily to support a company’s day-to-day operations. (Also known as a transactional database, OLTP database, or production database)
performance tuning
Activities that make a database perform more efficiently in terms of storage and access speed
physical data format
The way a computer “sees” (stores) data
query
A question or task asked by an end user of a database in the form of SQL code. A specific request for data manipulation issued by the end user or the application to the DBMS
query language
A nonprocedural language that is used by a DBMS to manipulate its data (such as SQL)
query result set
The collection of data rows returned by a query
record
A collection of related (logically connected) fields
role
In Oracle, a named collection of database access privileges that authorize a user to connect to a database and use its system resources
router
1) An intelligent devise used to connect dissimilar networks
2) Hardware/software equipment that connects multiple and diverse networks
semistructured data
Data that has already been processed to some extent
single-user database
A database that supports only one user at a time
social media
Web and mobile technologies that enable “anywhere, anytime, always on” human interactions
structural dependence
A data characteristic in which a change in the database schema affects data access, thus requiring changes in all access programs
structural independence
A data characteristic in which changes in the database schema do not affect data access
structured data
Data that has been formatted to facilitate storage, use, and information generation
structured query language (SQL)
A powerful and flexible relational database language composed of commands that enable users to create database and table structures, perform various types of data manipulation and data administration, and query the database to extract useful information
unstructured data
Data that exists in its original, raw state; that is, in the format in which it was collected
website
Refers to the web server and the collection of web pages stored on the local hard disk of the server computer
workgroup database
A multiuser database that usually supports fewer than 50 users or is used for a specific department in an organization
world wide web (WWW or the web)
Worldwide network collection of specially formatted and interconnected documents known as web pages
XML database
A database system that stores and manages semistructured XML data
file structure
The format that data is arranged and stored
flat files
Files having no internal hierarchy
heap files
Files containing an unsorted set of records that are uniquely identified by a record ID which allows them to be inserted or deleted using that ID
index files
Files that store a list of lookup field values from a data file
hashed files
Files that are encrypted using hash functions that convert data consisting of various formats into numeric values
data management
The discipline that focuses on the proper generation, storage, and retrieval of data
distributed database
Supports data across multiple sites
general purpose database
Contains a wide variety of data used in multiple disciplines
business intelligence
A comprehensive approach to capture and process business data with the purpose of generating information to support business decision making
semi-structured data
Data that is already processed to some extent
extensible markup language (XML)
Used to represent and manipulate data elements in a textual format
data dictionary management
Removes structural and data dependency and provides the user with data abstraction. The DBMS uses this function to look up the required data component structures and relationships
data transformation and presentation
Exists to transform any data entered into required data structures. By using this function, the DBMS can determine the difference between logical and physical data formats
security management
Sets rules that determine specific users that are allowed to access the database. This function also sets restraints on what specific data any user can see or manage
data modeling
The process of creating a specific data model for a determined problem domain
problem domain
A clearly defined area within the real-world environment, with a well-defined scope and boundaries that will be systematically addressed
data model
A representation, usually graphic, of a complex “real-world” data structure. Used in the database design phase of the database life cycle. It represents data structures and their characteristics, relations, constraints, transformations, and other constructs with the purpose of supporting a specific problem domain. A blueprint with all the instructions to build a database that will meet all end-user requirements.
basic building blocks of all data models
Constraints, attributes, relationships, entities (data models CARE!)
entity
A person, place, thing, concept, or event for which data can be sored
attribute
A characteristic of an entity or object. It has a name and a data type.
relationship
An association between entities
constraint
A restriction placed on data, usually expressed in the form of rules
one-to-one (1:) relationship
Exists between a pair of tables when a single record in the first table is related to only one record in the second table, and a single record in the second table is related to only one record in the first table
one-to-many (1:M) relationship
Exists between a pair of tables when a single record in the first single record in the first table can be related to one or more records in the second table, but a single record in the second table can be related to only one record in the first table.
many-to-many (M:M) relationship
Exists between a pair of tables when a single record in the first table can be related to one or more records in the second table and a single record in the second table can be related to one or more records in the first table.
business rule
A description of a policy, procedure, or principle within an organization.
hierarchical model
An early database model whose basic concepts and characteristics formed the basis for subsequent database development. It is based on an upside-down tree structure in which each record is called a segment. The top record is the root segment. Each segment has a 1:M relationship to the segment directly below it.
segment
In the hierarchical data model, the equivalent of a file system’s record type.
network model
An early data model that represented data as a collection of record types in 1:M relationships.
schema
A logical grouping of database objects, such as tables, indexes, views, and queries that are related to each other.
subschema
The portion of the database that interacts with application programs
data manipulation language (DML)
The set of commands that allows an end user to manipulate the data in the database, such as SELECT, INSERT, UPDATE, DELETE etc.
data definition language (DDL)
The language that allows a database administrator to define the database structure, schema, and subschema
relational model
Based on mathematical set theory and represents data as independent relations. Each relation (table) is related to each other through the sharing of common entity characteristics. Developed by E. F. Codd of IBM in 1970
relation or table
A two-dimensional structure composed of intersecting rows (entities) and columns (attributes).
tuple
A row in the relational model
relational database management system (RDBMS)
A collection of programs that manages a relational database
SQL-based relational database parts
- End-user interface
- A collection of tables stored in the database
- SQL engine
entity relationship (ER) model (ERM)
A data model that describes relationships (1:1, 1:M, and M:N) among entities at the conceptual level with the help of ER diagrams.
entity relationship diagram (ERD)
A diagram that depicts an entity relationship model’s entities, attributes, and relations
entity instance or entity occurance
A row in a relational table
entity set
A collection of like entities
object-oriented data model (OODM)
A data model whose basic modeling structure is an object
object
An abstract representation of a real world entity that has a unique identity, embedded properties, and the ability to interact with other objects and itself
object-oriented database management system (OODBMS)
Data management software used to manage data in an object-oriented database model
relational database
A collection of relations that contain the data describing a particular business environment.
class
A collection of similar objects with shared structure (attributes) and behavior (methods).
method
In the object-oriented data model, a named set of instructions to perform an action.
class hierarchy
The organization of classes in a hierarchical tree in which each parent class is a superclass and each child class is a subclass.
inheritance
In the object-oriented data model, the ability of an object to inherit the data structure and methods of the classes above it in the class hierarchy
unified modeling language (UML)
A language based on object-oriented concepts that provides tools such as diagrams and symbols to graphically model a system.
extended relational data model (ERDM)
A model that includes the object-oriented model’s best features in an inherently simpler relational database structural environment
object/relational database management system (O/R DBMS)
A DBMS based on the extended relational model (ERDM).
big data
A movement to find new and better ways to manage large amounts of web-generated data and derive business insight from it, while simultaneously providing high performance and scalability at a reasonable cost
volume, velocity, and variety (3 Vs)
Three basic characteristics of big data databases.
volume
In big data, this refers to the amounts of data being stored
velocity
In big data, this refers to the speed in which data grows and the need to process this data quickly in order to generate information and insight.
variety
In big data, this refers to the fact that the data being collected comes in multiple different data formats.
Hadoop
A Java based, open source, high speed, fault-tolerant distributed storage and computational framework for big data.
Hadoop Distributed File System (HDFS)
A highly distributed, fault-tolerant file storage system designed to manage large amounts of data a high speeds.
name node
One of three types of nodes used in the Hadoop Distributed File System (HDFS). It stores all the metadata about the file system.
data node
One of the three types of nodes used in the Hadoop Distributed File System (HDFS). It stores fixed-size data blocks.
client node
One of the three types of nodes used in the Hadoop Distributed File System (HDFS). It acts as the interface between the user application and the HDFS.
MapReduce
An open-source application programming interface (API) that provides fast data analytics services used for big data.
key-value
A data model based on a structure composed of two data elements: a key and a value, in which every key has a corresponding value or set of values. Also called the associative or attribute-value data model
sparse data
A case in which the number of table attributes is very large but the number of actual data instances is low
eventual consistency
A model for database consistency in which updates to the database will propagate through the system so that all data copies will be consistent eventually.
3 vs
Three basic characteristics of big data databases: volume, velocity, and variety
abstract data type (ADT)
Data type that describes a set of similar objects with shared and encapsulated data representation and methods. It is generally used to describe complex objects
American National Standards Institute (ANSI)
The group that accepted the DBTG recommendations and augmented database standards in 17=975 through its SPARC committee
application programming interface (API)
Software through which programmers interact with middleware. It allows the use of generic SQL code, thereby allowing client processes to be database server-independent
balancing
Ensuring that the processing load is distributed evenly among servers
Chen notation
Same as entity relationship (ER) model
class diagram notation
The set of symbols used in the creation of class diagrams in UML object mdeling
class diagrams
A diagram used to represent data and their relationships in UML object notation
client
Any process that requests specific services from server processes in a client/server environment
complex object
An object formed by several different objects in complex relationships
conceptual model
The output of the conceptual design process. It provides a global view of an entire database and describes the main data objects, avoiding details
conceptual schema
A representation of the conceptual model, usually expressed graphically
connectivity
The type of relationship between entities. Classifications include 1:1, 1:M, and M:N
Crow’s Foot notation
A representation of the entity relationship diagram that uses a three-pronged symbol to represent the “many” sides of the relationship
entity instance
A row in a relational table. Also known as entity occurrence
entity occurrence
A row in a relational table. Also known as entity instance
entity relationship model (ERM)
A data model that describes relationships (1:1, 1:M, and M:N) among entities at the conceptual level with the help of ER diagrams. Developed by Peter Chen
external model
The application programmer’s view of the data environment. Given its business focus, this works with a data subset of the global database schema
external schema
The specific representation of an external view; the end user’s view of the data environment
Hadoop distributed file system (HDFS)
A highly distributed, fault-tolerant file storage system designed to manage large amounts of data at high speeds
hardware independence
A condition in which a model does not depend on the hardware used in the model’s implementation. Changes in the hardware will have no effect on the database design at the conceptual level
internal model
In database modeling, a level of data abstraction that adapts the conceptual model to a specific DBMS model for implementation. It is the representation of a database as “seen” by the DBMS
internal schema
A representation of an internal model using the database constructs supported by the chosen database
internet
A global network of computers connected together through a standard network protocol known as Transmission Control Protocol/Internet Protocol (TCP/IP)
relation
A logical construct perceived to be a two dimensional structure composed of intersecting rows (entities) and columns (attributes) that represents an entity set in the relational model
relational diagram
A graphical representation of a relational database’s entities, the attributes within those entities, and the relationships among the entities
superclass
In a class hierarchy, this is the more general classification from which the subclasses inherit data structures and behaviors
table
A logical construct perceived to be a two dimensional structure composed of intersecting rows (entities) and columns (attributes) that represents an entity set in the relational model
versioning
A property of an OODBMS that allows the database to keep track of the different transformations performed on an object
candidate key
A minimal superkey; a key that does not contain a subset of attributes that is itself a superkey
composite entity
An entity designed to transform a M:N relationship into two 1:M relationships. It’s primary key comprises at least the primary key of the entities that it connects.
composite key
A multiple-attribute key
dependent
An attribute whose value is determined by another attribute
determination
The role of a key. In the context of a database table, the statement “A determines B” indicates that knowing the value of attribute A means that the value of B can be looked up
domain
In data modeling,the construct used to organize and describe an attribute’s set of possible values
entity integrity
The property of a relational table that guarantees each entity has a unique value in a primary key and that the key has no null values
flags
Special codes implemented by designers to trigger a required response, alert end users to specified conditions, or encode values. They may be used to prevent nulls by bringing attention to the absence of a value in a table
foreign key (FK)
An attribute or attributes in one table whose values must match the primary key in another table or whose values must be null
full functional dependence
A condition in which an attribute is functionally dependent on a composite key but not on any subset of the key
functional dependence
One attribute determines another
index
An ordered array of values and row IDs generally used to speed up and facilitate data retrieval
key
One or more attributes that determine other attributes
key attribute
The attributes that form a primary key
linking table
In the relational model, a table that implements a M:N relationship
null
The absence of an attribute value
predicate logic
Used extensively in mathematics to provide a framework in which a fact can be verified as true or false
primary key
In the relational model,an identifier composed of one or more attributes that uniquely identifies a row
prime attribute
A key attribute
referential integrity
A condition by which a dependent table’s foreign key must have either a null entry or a matching entry in the related table
secondary key
A key used to strictly for data retrieval purposes
set theory
A part of a mathematical science that deals with groups of things and is used as the basis for data manipulation in the relational model
superkey
An attribute or attributes that uniquely identify each entity in a table
unique index
An index in which the index key can have only one associated pointer value or row
binary relationship
An er term for an association (relationship) between two entities
cardinality
A property that assigns a specific value to connectivity and expresses the range of allowed entity occurrences associated with a single occurrence of the related entity
composite attribute
An attribute that can be further subdivided to yield additional attributes. For example, a phone number may be divided into an area code (630) and exchange number (421), and a four-digit code (9009)
composite identifier
In ER modeling, a key composed of more than one attribute
derived attribute
An attribute that does not physically exist within the entity and is derived via an algorithm. For example, the age attribute might be derived by subtracting the birth date from the current date
existence-dependent
A property of an entity whose existence depends on one or more other entities.
existence-independent
A property of an entity that can exist apart from one or more related entities
identifiers
One or more attributes that uniquely identify each entity instance
iterative process
A process based on repetition of steps and procedures
mandatory participation
A relationship in which one entity occurrence must have a corresponding occurrence in another entity
multivalued attributes
An attribute that can have many values for a single entity occurrence
non-identifying relationship
A relationship in which the primary key of the related entity does not contain a primary key component of the parent entity
optional attribute
In ER modeling, an attribute that does not require a value; therefore it can be left empty
optional participation
In ER modeling, a condition in which one entity occurrence does not require a corresponding entity occurrence in a particular relationship
participants
An ER term for entities that participate in a relationship
recursive relationshp
A relationship found within a single entity type
relational schema
The organization of a relational database as described by the database administrator
relationship degree
The number of entities or participants associated with a relationship It can be unary, binary, ternary, or higher
required attribute
In ER modeling, an attribute that must have a value. It cannot be left empty
simple attribute
An attribute that cannot be subdivided into meaningful components
single-valued attribute
An attribute that can have only one value
strong (identifying) relationship
A relationship that occurs when two entities are existence-dependent; from a database design perspective, this relationship exists whenever the primary key of the related entity contains the primary key of the parent entity
strong entity
An entity that is existence-independent, that is, it can exist apart from all of its related entities
ternary relationship
An ER term used to describe an association (relationship) between three entities
unary relationship
An ER term used to describe an association within an entity
weak entity
An entity that displays existence dependence and inherits the primary key of its parent eneity
weak relationship
A relationship in which the primary key of the related entity does not contain a primary key component of the parent entity
master key
atomic attribute
An attribute that cannot be further subdivided to produce meaningful components
atomicity
The transaction property that requires all parts of a transaction to be treated as a single, indivisible, logical unit of work. All parts of a transaction must be completed or the entire transaction is aborted
Boyce-Codd normal form (BCNF)
A special type of third normal form (3NF) in which every determinant is a candidate key
denormilization
A process by which a table is changed from a higher-level normal form to a lower-level normal form, usually to increase processing speed
dependency diagram
A representation of all data dependencies (primary key, partial, or transitive) within a table
determinant
Any attribute in a specific row whose value directly determines other values in that row
first normal form (1NF)
The first stage in the normalization process. It describes a relation depicted in tabular format,, with no repeating groups and a primary key identified. All non-key attributes in the relation are dependent on the primary key
fourth normal form (4NF)
A table that is in 3NF and contains no multiple independent sets of multivalued dependencies
granularity
The level of detail represented by the values stored in a table’s row. Data stored at its lowest level is said to be atomic data
nonprime (or nonkey) attribute
An attribute that is not part of a key
normalization
A process that assigns attributes to entities so that data redundancies are reduced or eliminated
partial dependency
A condition in which an attribute is dependent on only a portion (subset) of the primary key
repeating group
In a relation, a characteristic describing a group of multiple entries of the same type for a single key attribute occurrence
second normal form (2NF)
The second stage in the normalization process, in which a relation is in 1NF and there are no partial dependencies
third normal form (3NF)
A table in 2NF and no nonkey attribute is functionally dependent on another nonkey attribute
transitive dependency
A condition in which an attribute is dependent on another attribute that is not part of the primary key
alias
An alternative name for a column or table in a SQL statement
ALTER TABLE
The SQL command used to make changes to table structure. When the command is followed by a keyword, it adds a column or changes column characteristics
AND
The SQL logical operator used to link multiple conditional expressions in a WHERE or HAVING clause. It requires that all conditional expressions evaluate to true
AVG
A SQL aggregate function that outputs the mean for a specified column or expression
BETWEEN
In SQL, a special comparison operator used to check whether a value is within a range of specified values
Boolean algebra
A branch of mathematics that uses the logical operators OR, AND, and NOT
cascading order sequence
A nested ordering sequence for a set of rows, such as a list in which all last names are alphabetically ordered and, within the last names, all first names are ordered
COMMIT
The SQL command that permanently writes data changes to a database
COUNT
A SQL aggregate function that outputs the number of rows containing not null values for a given column or expression, sometimes used in conjunction with the DISTINCT clause
CREATE INDEX
A SQL command that creates indexes on the basis of a selected attribute or attributes
CREATE TABLE
A SQL command that creates a table’s structures using the characteristics and attributes given
DELETE
A SQL command that allows data rows to be permanently removed from a table
DISTINCT
A SQL clause that produces only a list of values that are different from one another
DROP INDEX
A SQL command used to delete database objects such as tables, views, indexes, and users
DROP TABLE
A SQL command used to delete database objects such as tables, views, indexes, and users
EXISTS
In SQL, a comparison operator that checks whether a subquery returns any rows
FROM
A SQL clause that specifies the table or tables from which data is to be retrieved
GROUP BY
A SQL clause used to create frequency distributions when combined with any of the aggregate functions in a SELECT statement
HAVING
A clause applied to the output of a GROUP BY operation to restrict selected rows
IN
In SQL, a comparison operator used to check whether a value is among a list of specified values
inner query
A query that is embedded or nested inside another query
INSERT
A SQL command that allows the addition of one or more data rows into a table
International Organization for Standardization (ISO)
An organization formed to develop standards for diverse network systems
IS NULL
In SQL, a comparison operator used to check whether an attribute’s text value matches a specified string pattern
MAX
A SQL aggregate funciton that yields the maximum attribute value in a given column
MIN
A SQL aggregate function that yields the minimum attribute value in a given column
nested query
In SQL, a query that is embedded in another query
NOT
A SQL logical operator that negates a given predicate
OR
The SQL logical operator used to link multiple conditional expressions in a where or HAVING clause. It requires only one of the conditional expressions to be true
ORDER BY
A SQL clause that is useful for sorting the output of a SELECT query
ROLLBACK
A SQL command that restores the database table contents to the condition that existed after the last COMMIT statement
rules of precedence
Basic algebraic rules that specify the order in which operations are performed
subquery
A query that is embedded (or nested) inside another query
SUM
A SQL aggregate function that yields the total of all values for given column or expression
UPDATE
A SQL command that allows attribute values to be changed in one or more rows of a table
WHERE
A SQL clause that adds conditional restrictions to a SELECT statement that limit the rows returned by the query
wildcard character
A symbol that can be used as a general substitute
business intelligence (BI)
A comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate, store, and analyze data with the purpose of generating and presenting information to support business decision making
cube cache
In multidimensional OLAP, the shared, reserved memory area where data cubes are held
dashboards
In business intelligence, a web-based system that presents key business performance indicators or information in a single, integrated view with clear and concise graphics
data cube
The multidimensional data structure used to store and manipulate data in a multidimensional DBMS. The location of each data value is based on its x y and z axes. They are static, meaning they must be created before they are used, so they cannot be created by an ad hoc query
data mart
A small, single-subject data warehouse subset that provides decision support to a small group of people
data visualization
Abstracting data to provide information in a visual format that enhances the user’s ability to effective comprehend the meaning of the data
data warehouse
An integrated, subject-oriented, time-variant, nonvolatile collection of data in a specialized database that stores historical and aggregate data in a format that provides support for decision making
decision support system (DSS)
An arrangement of computerized tools used to assist managerial decision making within a business
dimension tables
In a data warehouse, tables used to search, filter, or classify facts within a star schema
drill down
To decompose data into more atomic components – that is, data at lower levels of aggregation
extraction, transformation, and loading (ETL)
In a data warehousing environment, the integrated processes of getting data from original sources into the data warehouse. It includes retrieving data from original data sources, manipulating the data into an appropriate form, and storing the data in the data warehouse
governance
In business intelligence, the methods for controlling and monitoring business health and promoting consistent decision making
key performance indicators (KPIs)
In business intelligence, quantifiable numeric or scale-based measurements that assess a company’s effectiveness or success in reaching strategic and operational goals.
master data management (MDM)
In business intelligence, a collection of concepts, techniques, and processes for the proper identification, definition, and management of data elements within an organization
multidimensional database management systems (MDBMSs)
A database management system that uses proprietary techniques to store data in matrix-like arrays of n dimensions known as cubes
multidimensional online analytical processing (MOLAP)
An extension of online analytical processing to multidimensional database management systems
portals
In terms of business intelligence, a unified, single point of entry for information distribution
relational online analytical processing (ROLAP)
Analytical processing functions that use relational databases and familiar relational query tools to store and analyze multidimensional data
roll up
1) To aggregate data into summarized components, that is, higher levels of aggregation; 2) In SQL, an OLAP extension used with the GROUP BY clause to aggregate data by different dimensions; 3) opposite of drill down
slice and dice
The ability to focus on slices of a data cube (drill down or roll up) to perform a more detailed analysis
sparsity
In multidimensional data analysis, a measurement of the data density held in the data cube
very large databases (VLDBs)
Database that contains huge amounts of data – gigabyte, terabyte, and petabyte ranges are not unusual