{"id":109894,"date":"2023-07-25T10:34:18","date_gmt":"2023-07-25T10:34:18","guid":{"rendered":"https:\/\/learnexams.com\/blog\/?p=109894"},"modified":"2023-07-25T10:34:21","modified_gmt":"2023-07-25T10:34:21","slug":"hcad-750-exam-5-louisiana-state-university-questions-answers","status":"publish","type":"post","link":"https:\/\/www.learnexams.com\/blog\/2023\/07\/25\/hcad-750-exam-5-louisiana-state-university-questions-answers\/","title":{"rendered":"HCAD 750 EXAM 5; Louisiana State University\/Questions &amp; Answers"},"content":{"rendered":"\n<p>the process of finding correlations or patterns among the data<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>facilitates data exploration<\/li>\n\n\n\n<li>extract useful knowledge hidden in data<br>data mining<\/li>\n<\/ul>\n\n\n\n<p>using patient data for any purpose beyond providing care for the individual patient brings with it some tricky issues regarding privacy, and keeping the information from falling into the wrong hands. There are significant legal issues related to the use of patient data in data mining efforts, specifically related to the de-identification, aggregation, and storage of the data. Failing to take the appropriate steps when using personal health data as a tool for population health could lead to serious consequences<br>HIPPA in relation to Data mining<\/p>\n\n\n\n<p>-perform induction on the current data in order to make predictions.<br>Predictive Data Mining<\/p>\n\n\n\n<p>-ability for a device, machine, etc. to be able to take in numerous types of data and learn from the data in order to produce knowledge.<br>Meta-learning<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>investigates how computers can learn based on data<\/li>\n\n\n\n<li>automatically learn to recognize complex patterns and make intelligent decisions on their own based on the data<br>Machine Learning<\/li>\n<\/ul>\n\n\n\n<p>refers to the process of reducing the inputs for processing and analysis, or finding the most meaningful inputs.<br>Feature Selection<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>be applied to obtain a compressed representation of the data set that is much smaller in volume, yet maintains the integrity of the original data.<\/li>\n\n\n\n<li>used when the data selected is too complex or huge<br>Data reduction<\/li>\n<\/ul>\n\n\n\n<p>to request or seek out additional information on a specific subject. Makes the data more detailed.<br>drill down<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>is an ensemble of models combined sequentially.<\/li>\n\n\n\n<li>can be used to classify data<\/li>\n\n\n\n<li>get a meta-learning device, stack the data in the device, the base learner is combined and produces the data information needed.<br>Stacking<\/li>\n\n\n\n<li>each of the data classifications are weighted.<\/li>\n\n\n\n<li>once the system learns, it is able to continuously update and learn which ones are incorrect, and the weight shifts to reflect the accuracy<br>Boosting<\/li>\n\n\n\n<li>method used to increase accuracy with data mining<\/li>\n\n\n\n<li>majority vote; more times a classification is picked, the more reliable the data.<\/li>\n\n\n\n<li>algorithm creates an ensemble of models for learning scheme where each model gives an equally weighted prediction<br>Bagging (Bootstrap Aggregating)<\/li>\n<\/ul>\n\n\n\n<p>DMAIC steps: define, measure, analyze, improve, and control<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>can explain why data behaves a certain way<\/li>\n\n\n\n<li>not necessarily a data mining technique, but a model used to give more of answer to &#8220;why&#8221; and &#8220;how&#8221; in regard to data information.<\/li>\n\n\n\n<li>adds additional steps to mining that yields better results<br>Six Sigma<\/li>\n<\/ul>\n\n\n\n<p>is a term that describes the large volume of data &#8211; both structured and unstructured &#8211; that inundates a business on a day-to-day basis. But it&#8217;s not the amount of data that&#8217;s important. It&#8217;s what organizations do with the data that matters. Big data can be analyzed for insights that lead to better decisions and strategic business moves.<br>Big Data<\/p>\n\n\n\n<p>how we make sense of the data by converting them from their raw form to a more informative one<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>sometimes known as model building or pattern id<\/li>\n\n\n\n<li>yields a highly predictive, consistent pattern identifying model<br>-pattern discovery is a complex phase of data mining<br>Exploratory data analysis (EDA)<\/li>\n<\/ul>\n\n\n\n<p>due to a need for standardized data mining techniques, this concept and tool was developed.<br>Sample &#8211; selecting the data<br>Explore &#8211; looking for the relationship between variables in data<br>Modify &#8211; methods to select, create, and transform variables in preparation for data modeling<br>Model &#8211; applying various modeling techniques to gain the desired outcome<br>Assess &#8211; looks for reliability and usefulness<br>SEMMA<\/p>\n\n\n\n<p>Cross Industry Standard Process for Data Mining<br>six steps: business understanding, data understanding, data preparation, modeling, evaluation, and deployment<br>most projects move back and forth between steps as necessary<br>\u00b7CRISP-DM<\/p>\n\n\n\n<p>&#8220;this data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals.&#8221;<br>big data<\/p>\n\n\n\n<p>producing a solution that generates useful forecasting:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>problem identification<\/li>\n\n\n\n<li>exploration of the data<\/li>\n\n\n\n<li>pattern discovery<\/li>\n\n\n\n<li>knowledge deployment &#8211; application to new data to forecast predictions<br>4 phases of data mining<\/li>\n<\/ol>\n\n\n\n<p>transform the repositories of big data into comprehensible knowledge that is useful for guiding their practice and facilitating interdisciplinary research<br>Knowledge Discovery and Research<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>data mining method for analyzing outcomes and service use<\/li>\n\n\n\n<li>used to classify and predict an outcome<br>Classification and Regression Trees (CART)<\/li>\n<\/ul>\n\n\n\n<ol class=\"wp-block-list\">\n<li>enhance business aspects<\/li>\n\n\n\n<li>help to improve patient care<br>Benifits of KDD<\/li>\n\n\n\n<li>dependent on the use of private health information<\/li>\n\n\n\n<li>insure data is de-identified and confidentiality maintained<\/li>\n\n\n\n<li>follow changes and specific requirements for compliance with HIPPA laws<br>ethics of data mining<\/li>\n<\/ol>\n\n\n\n<p>thoughtful, planned activity that expands or refines knowledge. the purpose of research is to create generalized knowledge.<br>research<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>manipulation of treatment<\/li>\n\n\n\n<li>random assignment to the group<br>difference between quasi-experimental research and experimental research<\/li>\n<\/ol>\n\n\n\n<ul class=\"wp-block-list\">\n<li>the statistical analysis of a large collection of results from individual studies for the purpose of integrating findings<\/li>\n\n\n\n<li>the integrative analysis of findings from many studies that examined the same question<br>meta-analysis<\/li>\n\n\n\n<li>set of connected input\/output units i which each connection has a weight associated with it<br>AKA connectionist learning &#8211; connection between units<br>neural networks<\/li>\n<\/ul>\n\n\n\n<p>a flowchart-like structure and a decision support tool that uses a model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility.<\/p>\n\n\n\n<p>Consists of three types of nodes: decision nodes, chance nodes, end nodes<br>decision trees<\/p>\n\n\n\n<p>identifies patterns from if\/then statements. Statistical significance tests are used on the data<br>Rule Induction<\/p>\n\n\n\n<p>a process or set of rules to be followed in calculations or other problem-solving operations, especially by a computer.<br>Algorithm<\/p>\n\n\n\n<p>classifiers that use distance based comparisons that intrinsically assign equal weight to each attribute<br>nearest neighbor<\/p>\n\n\n\n<p>discovery by computer of new, previously unknown information, by automatically extracting information from written resources<br>text mining<\/p>\n\n\n\n<p>A method of querying and reporting that takes data from standard relational databases, calculates and summarizes the data, and then stores the data in a special database called a data cube.<br>Online Analytical Processing (OLAP)<\/p>\n\n\n\n<p>select on-screen specific data points and identify their characteristics or to examine their effects on relations between variables<br>-used during EDA<br>brushing<\/p>\n\n\n\n<p>analysis of original research data by the researchers who collected them<br>primary analysis<\/p>\n\n\n\n<p>the process of detecting, diagnosing, and editing faulty data<br>data cleansing<\/p>\n\n\n\n<p>refers to the ability to access and extract data from any data source<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>access to data depends on the type of data and their location<\/li>\n\n\n\n<li>can range from totally uncontrolled to highly protected<br>Data Access<\/li>\n<\/ul>\n\n\n\n<p>pertains to the handling and maintenance of data so that the data are not divulged to others without the research participants permission<br>data confidentiality<\/p>\n\n\n\n<p>the analysis of the original work of another person or organization<br>secondary analysis<\/p>\n\n\n\n<p>pertains to data that have no identifiers linked to them and cannot be traced back to the research participant<br>data anonymity<\/p>\n\n\n\n<p>US federal policy that specifies ethics regulations for human subjects research<br>Common Rule<\/p>\n\n\n\n<p>data mining<br>the process of analyzing data to extract information not offered by the raw data alone<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>facilitates data exploration<\/li>\n\n\n\n<li>looks at the data from different vantage points<\/li>\n\n\n\n<li>brings new insights to the data set<\/li>\n<\/ul>\n\n\n\n<p>HIPPA in relation to Data mining<br>using patient data for any purpose beyond providing care for the individual patient brings with it some tricky issues regarding privacy, and keeping the information from falling into the wrong hands. There are significant legal issues related to the use of patient data in data mining efforts, specifically related to the de-identification, aggregation, and storage of the data. Failing to take the appropriate steps when using personal health data as a tool for population health could lead to serious consequences<\/p>\n\n\n\n<p>Predictive Data Mining<br>is data mining that is done for the purpose of using business intelligence or other data to forecast or predict trends. This type of data mining can help business leaders make better decisions and can add value to the efforts of the analytics team.<\/p>\n\n\n\n<p>Meta-learning<br>A subfield of machine learning where automatic learning algorithms are applied to Metadata about machine learning experiments. As of 2017, the term had not found a standard interpretation, however, the main goal is to use such metadata to understand how automatic learning can become flexible in solving learning problems<\/p>\n\n\n\n<p>Machine Learning<br>is a branch of artificial intelligence devoted to guiding robots in their understanding of human behavior. Scientists and engineers hope machine learning will eventually help machines make unguided choices by independently interpreting input from the world around them.<\/p>\n\n\n\n<p>Feature Selection<br>refers to the process of reducing the inputs for processing and analysis, or of finding the most meaningful inputs. A related term, feature engineering (or feature extraction), refers to the process of extracting useful information or features from existing data.<\/p>\n\n\n\n<p>Data reduction<br>be applied to obtain a compressed representation of the data set that is much smaller in volume, yet maintains the integrity of the original data.<\/p>\n\n\n\n<p>drill down<br>to request \u2014 or seek out \u2014 additional information on a specific subject. In a GUI-environment, drilling down may involve clicking on a link or other representation to reveal more detail<\/p>\n\n\n\n<p>Stacking<br>is an ensemble of models combined sequentially. It uses a &#8220;meta learner&#8221; (not voting) to combine the predictions of &#8220;base learners.&#8221; The base learners (the expert) are not combined by voting but by using a meta-learner, another learner scheme that combines the output of the base learners.<\/p>\n\n\n\n<p>Boosting<br>refers to a family of algorithms which converts weak learner to strong learners. It is an ensemble method for improving the model predictions of any given learning algorithm. The idea of boosting is to train weak learners sequentially, each trying to correct its predecessor<\/p>\n\n\n\n<p>Bagging (Bootstrap Aggregating)<br>is a machine learning ensemble meta-algorithm designed to improve the stability and accuracy of machine learning algorithms used in statistical classification and regression. It also reduces variance and helps to avoid overfitting.<\/p>\n\n\n\n<p>Six Sigma<br>A disciplined, data-driven approach and methodology for eliminating defects (driving toward six standard deviations between the mean and the nearest specification limit) in any process &#8211; from manufacturing to transactional and from product to service.<\/p>\n\n\n\n<p>Big Data<br>is a term that describes the large volume of data &#8211; both structured and unstructured &#8211; that inundates a business on a day-to-day basis. But it&#8217;s not the amount of data that&#8217;s important. It&#8217;s what organizations do with the data that matters. Big data can be analyzed for insights that lead to better decisions and strategic business moves.<\/p>\n\n\n\n<p>Exploratory data analysis (EDA)<br>how we make sense of the data by converting them from their raw form to a more informative one<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>sometimes known as model building or pattern id<\/li>\n\n\n\n<li>yields a highly predictive, consistent pattern identifying model<\/li>\n<\/ul>\n\n\n\n<p>SEMMA<br>An alternative process for data mining projects proposed by the SAS Institute. stands for &#8220;sample, explore, modify, model, and assess.&#8221;<\/p>\n\n\n\n<p>\u00b7CRISP-DM<br>Cross Industry Standard Process for Data Mining<br>most comprehensive, common, and standardized data mining process<\/p>\n\n\n\n<p>big data<br>&#8220;this data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals.&#8221;<\/p>\n\n\n\n<p>4 phases of data mining<br>producing a solution that generates useful forecasting:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>problem identification<\/li>\n\n\n\n<li>exploration of the data<\/li>\n\n\n\n<li>pattern discovery<\/li>\n\n\n\n<li>knowledge deployment &#8211; application to new data to forecast predictions<\/li>\n<\/ol>\n\n\n\n<p>Knowledge Discovery and Research<br>transform the repositories of big data into comprehensible knowledge that is useful for guiding their practice and facilitating interdisciplinary research<\/p>\n\n\n\n<p>Classification and Regression Trees (CART)<br>data mining method for analyzing outcomes and service use<\/p>\n\n\n\n<p>Benifits of KDD<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>enhance business aspects<\/li>\n\n\n\n<li>help to improve patient care<\/li>\n<\/ol>\n\n\n\n<p>ethics of data mining<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>dependent on the use of private health information<\/li>\n\n\n\n<li>insure data is de-identified and confidentiality maintained<\/li>\n\n\n\n<li>follow changes and specific requirements for compliance with HIPPA laws<\/li>\n<\/ol>\n\n\n\n<p>research<br>thoughtful, planned activity that expands or refines knowledge. the purpose of research is to create generalized knowledge.<\/p>\n\n\n\n<p>characteristics of theory<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>simplify the situation<\/li>\n\n\n\n<li>explain the most facts in the broadest range of circumstances<\/li>\n\n\n\n<li>accurately predict behavior<\/li>\n<\/ol>\n\n\n\n<p>advantages of models<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>portray theories with objects, smaller scaled versions, or graphic representations.<\/li>\n\n\n\n<li>aid in comprehension of a theory<\/li>\n\n\n\n<li>includes all of a theory&#8217;s known properties<\/li>\n<\/ol>\n\n\n\n<p>inductive reasoning<br>involves drawing conclusions based on a limited number of observations<\/p>\n\n\n\n<p>deductive reasoning<br>involves drawing conclusions based on generalizations<\/p>\n\n\n\n<p>7 basic steps of research<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>defining the problem<\/li>\n\n\n\n<li>performing a literature review<\/li>\n\n\n\n<li>determining a research method<\/li>\n\n\n\n<li>selecting an instrument<\/li>\n\n\n\n<li>gathering data<\/li>\n\n\n\n<li>analyzing the data<\/li>\n\n\n\n<li>presenting results<\/li>\n<\/ol>\n\n\n\n<p>5 characteristics of a well developed research question<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>clearly and exactly stated<\/li>\n\n\n\n<li>has theoretical significance<\/li>\n\n\n\n<li>has obvious links to a larger body of knowledge<\/li>\n\n\n\n<li>results advance knowledge in a definable way<\/li>\n\n\n\n<li>answer to the question is worthwhile<\/li>\n<\/ol>\n\n\n\n<p>3 sources of meaningful research questions<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>research models-show all the factors and relationships in a theory<\/li>\n\n\n\n<li>recommendations of earlier researchers<\/li>\n\n\n\n<li>gaps in the body of knowledge<\/li>\n<\/ol>\n\n\n\n<p>historical research<br>understand past events<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>case study<\/li>\n\n\n\n<li>bibliography<\/li>\n<\/ul>\n\n\n\n<p>descriptive research<br>describe current status<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>survey<\/li>\n\n\n\n<li>observation<\/li>\n<\/ul>\n\n\n\n<p>correlational research<br>determine existence and degree of relationship<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>survey<\/li>\n\n\n\n<li>secondary analysis<\/li>\n<\/ul>\n\n\n\n<p>evaluation research<br>evaluate effectiveness<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>case study<\/li>\n<\/ul>\n\n\n\n<p>experimental research<br>establish cause and effect. key defining characteristic is control.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>clinical trial<\/li>\n\n\n\n<li>pre test &amp; post test control group method<\/li>\n<\/ul>\n\n\n\n<p>casual comparative research<br>detect casual relationship<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>one shot case study<\/li>\n\n\n\n<li>static group comparison<\/li>\n\n\n\n<li>nonparticipant observation<\/li>\n<\/ul>\n\n\n\n<p>what determines a researchers choice of research design?<br>depends on the purpose of the research<\/p>\n\n\n\n<p>independent variable<br>factors that researchers manipulate directly<\/p>\n\n\n\n<p>dependent variable<br>factors that are measured variables which depend on independent variables<\/p>\n\n\n\n<p>difference between quasi-experimental research and experimental research<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>manipulation of treatment<\/li>\n\n\n\n<li>random assignment to the group<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>the process of finding correlations or patterns among the data using patient data for any purpose beyond providing care for the individual patient brings with it some tricky issues regarding privacy, and keeping the information from falling into the wrong hands. There are significant legal issues related to the use of patient data in data [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[],"tags":[],"class_list":["post-109894","post","type-post","status-publish","format-standard","hentry"],"_links":{"self":[{"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/posts\/109894","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/comments?post=109894"}],"version-history":[{"count":0,"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/posts\/109894\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/media?parent=109894"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/categories?post=109894"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.learnexams.com\/blog\/wp-json\/wp\/v2\/tags?post=109894"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}