<?xml version="1.0"?>
<rss version="2.0">
   <channel>
      <title>Data Mining  by KP</title>
      <link>https://padlet.com/kpherng/pouehjp444iq</link>
      <description></description>
      <language>en-us</language>
      <pubDate>2018-04-12 12:34:13 UTC</pubDate>
      <lastBuildDate>2023-02-28 10:23:20 UTC</lastBuildDate>
      <webMaster>hello@padlet.com</webMaster>
      <image>
         <url>https://padlet-assets.s3.amazonaws.com/icons/Balance.png</url>
      </image>
      <item>
         <title>What data mining function does a department store need to assist with its target marketing mail campaign? </title>
         <author></author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/251106621</link>
         <description><![CDATA[<div>- Association function may be used in data mining to determine what product that customer are likely to purchase next when they have already previously bought another product. Using this info, we may email customers that are more likely to buy the products.<br>- clustering function also can be used to determine the customer personal detail such as gender / age group / married status to sent them suitable email to them. For example, for female customer, the company can sent them email that related to female product<br>- description method<br>- Constrain - based method could be a good approach. A constraint refers to the user expectation or the properties of desired clustering results.<br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-04-12 12:35:33 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/251106621</guid>
      </item>
      <item>
         <title>Can they be performed alternatively by data query processing or simple statistical analysis?</title>
         <author></author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/251106713</link>
         <description><![CDATA[<div>- Statistics is only about quantifying data. While it uses tools to find relevant properties of data, it is a lot like math. It provides the tools necessary for data mining.<br><br>- Query Process, on the other hand, builds models to detect patterns and relationships in data, particularly from large data bases.<br><br>-  Statics is very useful and effective for small group of data , however, query process which is data mining will be more effective for large amount of data. It will provde more comprehensive anlaysis of data and faster the data process. </div>]]></description>
         <enclosure url="" />
         <pubDate>2018-04-12 12:35:49 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/251106713</guid>
      </item>
      <item>
         <title>Question </title>
         <author>kpherng</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/251107691</link>
         <description><![CDATA[<div> What <em>data mining function</em> does a department store need to assist with its target marketing mail campaign? Can they be performed alternatively by data query processing or simple statistical analysis? </div>]]></description>
         <enclosure url="" />
         <pubDate>2018-04-12 12:38:16 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/251107691</guid>
      </item>
      <item>
         <title></title>
         <author></author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/251109187</link>
         <description><![CDATA[]]></description>
         <enclosure url="https://padlet-uploads.storage.googleapis.com/281165630/262379e43b5ba959a0defd9af1c5634e/tut1_group5.docx" />
         <pubDate>2018-04-12 12:41:24 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/251109187</guid>
      </item>
      <item>
         <title>&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt;</title>
         <author>kpherng</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/262108943</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2018-05-19 14:58:37 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/262108943</guid>
      </item>
      <item>
         <title>Chapter 1 - ALVIN</title>
         <author></author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/262109679</link>
         <description><![CDATA[<div>Data Mining overview<br><br>Front part all logic<br>no need read<br><br>DM - extract information from a data set and transform it into an understandable structure for further use<br>- turn raw data into useful info<br><br>moores law<br>- computer speed double every 18 months<br><br>storage law<br>- total storage double every 9 months<br><br>6 steps<br>- Busniess understanding<br>- Data Understading/preparation/collection<br>- pre-procesing<br>-Modelling<br>-Mining<br>-Evaluation</div>]]></description>
         <enclosure url="" />
         <pubDate>2018-05-19 15:09:04 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/262109679</guid>
      </item>
      <item>
         <title>Chapter 3 - KP</title>
         <author>kpherng</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/262153524</link>
         <description><![CDATA[<div>Discrete attribute<br>- finite<br>- countable eg zip codes, counts<br>- integer<br><br>Continuous attribute<br>- real numbers<br>- floating point<br><br>Nominal<br>- ID, zip codes, profession, eye color<br><br>Ordinal<br>- in order, rankings, grades<br><br>Binary<br>- true false, yes no<br><br>Interval<br>- dates<br><br>Ratio<br>- temperature in C vs F<br>- distance in CM vs Inch<br><br>Data &amp; Data Pre-processing&nbsp;<br>Type of data sets<br>- Record (relational records) (a set of item like ppl who buy a cpu from computer have 90% will buy a mobo / 80% will buy ram / 50% case)<br>- Graph (From WWW, Social network)<br>- Ordered (maps / times-series)<br>why need prepocess data&nbsp;<br>- incomplete<br>- inconsistent (birth in 1996 and age is 50)&nbsp;<br>- noise (error)<br>all is no quality data which will make the mining results no quality&nbsp;<br><br>binning to handle noisy data<br>- smooth data by partition<br><br>Task in Data Preprocessing&nbsp;<br>- Data Cleaning clean dirty data (inconsistent, incomplete, noise)<br>- Data integration&nbsp;<br>- Data transformation&nbsp;<br>-&nbsp; Data reduction&nbsp;or data compression<br><br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-05-20 07:32:57 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/262153524</guid>
      </item>
      <item>
         <title>Chapter 2 data mining overview</title>
         <author>eddion717</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263908003</link>
         <description><![CDATA[<div>Prediction Method<br>- use variables and values to predict future values<br>- classification<br>- Anomaly Detection<br>- Regresson<br><br><br>Description Method<br>- human interpretable patterns<br>- Clustering<br>- Sequential Pattern discovery<br>- Association rule discovery<br><br>Classification<br>- supervised learning<br>- assigns item in a collection to target categories or classes<br>- to accurately predict target class for each case in the data<br><br>Regression<br>- predict numbers<br><br>Clustering<br>- find groups of closely related observtions<br>- euclidean distance&nbsp;<br><br>Sequential pattern<br>- predict strong sequential dependencies aming different events<br>-&nbsp;(A B) (C) -&gt; (D E)<br><br><br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-05-27 23:22:36 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263908003</guid>
      </item>
      <item>
         <title></title>
         <author>kpherng</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263909996</link>
         <description><![CDATA[]]></description>
         <enclosure url="https://padlet-uploads.storage.googleapis.com/121077726/f9114c53a5db705c120728069bbd0234/c2.png" />
         <pubDate>2018-05-27 23:43:42 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263909996</guid>
      </item>
      <item>
         <title></title>
         <author>kpherng</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263910036</link>
         <description><![CDATA[]]></description>
         <enclosure url="https://padlet-uploads.storage.googleapis.com/121077726/05e3afddf9fb7d08ef9cd17a003447cf/c3.png" />
         <pubDate>2018-05-27 23:44:16 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263910036</guid>
      </item>
      <item>
         <title></title>
         <author>kpherng</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263910066</link>
         <description><![CDATA[]]></description>
         <enclosure url="https://padlet-uploads.storage.googleapis.com/121077726/d0e86ebf22f9f8966de34de7b6de7e18/c4.png" />
         <pubDate>2018-05-27 23:44:40 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263910066</guid>
      </item>
      <item>
         <title>Chapter 4</title>
         <author>eddion717</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263911190</link>
         <description><![CDATA[<div>apriori algorithm<br>- if itemset is frequent, subsets must also be frequent<br><br>multiple level association rule<br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-05-27 23:56:08 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263911190</guid>
      </item>
      <item>
         <title>Chapter 5</title>
         <author>eddion717</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263911516</link>
         <description><![CDATA[<div>Classification<br>2 step process<br>- Model construction (step1)<br>- model is represented as classificaiton rules, decision trees, mathematical formula<br>- Model usage (step 2)<br><br>decision tree <br>-......<br><br>Bayesian classification<br>- probability<br><br><br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-05-28 00:00:44 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263911516</guid>
      </item>
      <item>
         <title></title>
         <author>kpherng</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263911847</link>
         <description><![CDATA[]]></description>
         <enclosure url="https://padlet-uploads.storage.googleapis.com/121077726/37cf709078c5fd87c432b5394c91d105/sample.png" />
         <pubDate>2018-05-28 00:05:23 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263911847</guid>
      </item>
      <item>
         <title>chapter 6</title>
         <author>eddion717</author>
         <link>https://padlet.com/kpherng/pouehjp444iq/wish/263911940</link>
         <description><![CDATA[<div>Classification<br>KNN other names<br>- memory based reasoning<br>- example based reasoning<br>- instance based learing<br>- case based reasoning<br>- lazy learning<br><br>rule based classifier<br>eg (can give birth) = mammal<br>eg (can fly) = bird<br><br>direct method vs undirect method<br><br>mutual exclusive rule vs exhaustive rule<br><br>strategy for single rule<br>- top down (general to specific)<br>- bottom up (specific to general)</div>]]></description>
         <enclosure url="" />
         <pubDate>2018-05-28 00:06:51 UTC</pubDate>
         <guid>https://padlet.com/kpherng/pouehjp444iq/wish/263911940</guid>
      </item>
   </channel>
</rss>
