Data Mining and Big data are two different things, while both of them relate to use of large datasets to handle the data that will serve our purpose, they are two different terms in the aspect of operation they are used for. Big Data refers to a collection of large datasets ( eg- datasets in Excel sheets which are too large to be handled easily). Data Mining on the other hand refers to the activity of going through a large chunk of data to look for relevant or pertinent information.
What is Big Data?
Big data refers to huge amount of data which is not easy to handle with conventional ways, it might be structured, semi- structured or unstructured. It comprises of 5 Vs-
Refers to amount of data. ( can be in quintillions)
Refers to type of data we can use. ( structured, unstructured or semi-structured)
Refers to the worth of data being extracted.
It refers to the quality or the trustworthiness of the data we have.
Refers to how fast our data is growing.
Why is Big Data Important?
Most things in today’s scenario are driven by profitability they give in terms of monetary benefits, these tools help in providing meaningful information for making better business decisions and can also be used to study various other things which could benefit humanity.
Why is Data Mining important?
Data Mining is important because of various reasons, the most vital and useful of them is to understand what is relevant and make a good use of it to assess the things as the new data comes into picture, this in turn branches into various use cases in places like healthcare industry, financial market analysis etc.
Having understood both the concepts fairly well, we can say they are 2 very different concepts, The main concept if we look in Data Mining is to dig into the data and analyse the pattern and relationship which can further be useful in prediction algorithms like of Linear Regression in Artificial Intelligence. The main concept in Big Data on the other hand is velocity, source, security of the huge amount of data at our disposal.
It can be said that Data Mining is not dependent on Big Data, as it can be done on any amount of data ( preferentially big, as it gives more test cases and hence accurate results) be it big or small. Big Data on the other hand is very much dependent on data mining as we need to find the use of the big volume of data we have, it is no use without its analysis.