วันจันทร์ที่ 27 เมษายน พ.ศ. 2558

7 Steps for Learning Data Mining and Data Science


  1. Languages: Learn R, Python, and SQL
  2. Tools: Learn how to use data mining and visualization tools
  3. Textbooks: Read introductory textbooks to understand the fundamentals
  4. Education: watch webinars, take courses, and consider a certificate or a degree in data science
  5. Data: Check available data resources and find something there
  6. Competitions: Participate in data mining competitions
  7. Interact with other data scientists, via social networks, groups, and meetings







1. Learning Languages


Recent KDnuggets Poll found that the most popular languages for data miningare R, Python, and SQL.
There are many resources for each, for example

2. Tools: Data Mining, Data Science, and Visualization Software


There are many data mining tools for different tasks, but it is best to learn using a data mining suite which supports the entire process of data analysis.
You can start with open source (free) tools such as KNIMERapidMiner, andWeka.
However, for many analytics jobs you need to know SAS, which is the leading commercial tool and widely used.
Other popular Analytics and Data Mining Software include MATLAB, StatSoft STATISTICA, Microsoft SQL Server, Tableau, IBM SPSS Modeler, and Rattle.
Visualization is an essential part of any data analysis - learn how to use Microsoft Excel (good for many simpler tasks), R graphics, (especiallyggplot2), and also Tableau - an excellent package for visualization. Other good visualization tools include TIBCO Spotfire and Miner3D.

3. Textbooks


There are many data mining and data science textbooks available, but you can check these

4. Education: Webinars, Courses, Certificates, and Degrees


You can start by watching some of the many free webinars and webcasts on latest topics in Analytics, Big Data, Data Mining, and Data Science.
There are also many online courses, short and long, many of them free - seeKDnuggets online education directory.
Check in particular these courses:
Finally, consider getting Certificates in Data Mining, and Data Science or advanced degrees, such as MS in Data Science - see KDnuggets directory forEducation in Analytics, Data Mining, and Data Science.

5. Data


You will need data to analyze - see KDnuggets directory of Datasets for Data Mining, including

6. Competitions


Again, you will best learn by doing, so participate in Kaggle competitions - start with beginner competitions, such as Predicting Titanic Survival using Machine Learning

7. Interact: Meetings, Groups, and Social Networks

AnalyticBridge is an active community for Analytics and Data Science.
Cr: http://www.kdnuggets.com/meetings/index.html

ไม่มีความคิดเห็น:

แสดงความคิดเห็น