Posts

Showing posts from August, 2021

⏳ Mining deep into Data Mining - Statistics - PART I ⏳

Image
 Why do we have to know statistics?🤔 As mentioned in the previous posts, we live in the world of data from which we can derive insightful information. Thus, Statistics play a vital role in processing and analyzing the data to make decisions and predictions. What actually is statistics? 👀 Let's get more technical  Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data . There are two types or classes of statistics.               👉  Descriptive statistics                👉  Inferential statistics Descriptive Statistics 😀                👉 Descriptive statistics focuses more on analyzing, summarizing, and organizing data in the form of numbers or graphs.                👉 Bar plots, histograms, pie-charts are used in visualizi...

💡⏳ Mining deep into Data Mining - PART II ⏳💡

Image
 Hurray💥 , we have seen the basics of data mining in Part I 😃 Let's get into the phases involved in the KDD process step by step. To start with, let's explore the Data Preprocessing phase. What actually is DATA in data mining? 🤔 In data mining, Data refers to the collection of objects and their attributes . Umm, Confusing right? 😨 👉 An Object is just like an entry in a table or an instance. It is also known as record, point, entity or sample. 👉 Attribute is any property or characteristic of an object. 👉 For example, If the eye of a person is considered as an object then, the eye color, blink rate are regarded as the attributes.  👉 Attribute can also be called a feature, field, characteristic, or variable in data mining. 👉 Here, the organization of data is in a tabular form.                                           ...

💡⏳ Mining deep into Data Mining - PART I ⏳💡

 "Necessity is the mother of invention" The need for knowledge is the root of data collection, discovery, and analysis. To be precise, we could say that the current technological world is  drowning in data but starving for knowledge. Thus, data mining comes in handy What is Data Mining? It is the extraction of interesting, non-trivial, previously unknown, potentially useful, patterns or knowledge from the huge amount of data. Want to know the alternative names of Data Mining? 👉 Knowledge Discovery and Databases (KDD) 👉 Data or Pattern analysis 👉 Data archeology 👉 Data dredging 👉 Information harvesting 👉 Business Intelligence Data mining is indeed a confluence of multiple disciplines mainly 👉 Statistics 👉 Algorithms 👉 Data visualization 👉 Machine learning 👉 Pattern recognition 👉 Database Technology Why not follow traditional data analysis? 👉 Traditional analysis of data will not be able to handle tera-bytes of data 👉 High dimensional data add complexity to the a...