Archive for the ‘Data Mining’ Category

Data Mining Conferences 2010-2011

Wednesday, December 1st, 2010

A list of data mining research conferences, workshops, and meetings. Some of them are as follows:

[1] SIAM Conference on Data Mining (SDM 2011)
This conference provides a venue for researchers who are addressing these problems to present their work in a peer-reviewed forum. It also provides an ideal setting for graduate students and others new to the field to learn about cutting-edge research by hearing outstanding invited speakers and attending tutorials (included with conference registration). A set of focused workshops are also held on the last day of the conference. The proceedings of the conference are published in archival form, and are also made available on the SIAM web site.
Website: http://www.siam.org/meetings/sdm11/

[2] SIGKDD Conference on Knowledge Discovery and Data Mining
The annual ACM SIGKDD conference is the premier international forum for data mining researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. KDD-2011 will feature keynote presentations, oral paper presentations, poster sessions, workshops, tutorials, panels, exhibits, demonstrations, and the KDD Cup competition.
Website: http://kdd.org/kdd/2011/

[3] ACM SIGMOD 2011
The annual ACM SIGMOD/PODS conference is a leading international forum for database researchers, practitioners, developers, and users to explore cutting-edge ideas and results, and to exchange techniques, tools, and experiences.
Website: http://www.sigmod2011.org/index.shtml

[4] IJCAI 2011
The IJCAI-11 Program Committee invites submissions of technical papers for IJCAI-11, to be held in Barcelona, Spain, July 19-22, 2011. Submissions are invited on significant, original, and previously unpublished research on all aspects of artificial intelligence. The theme of IJCAI-11 is “Integrated and Embedded Artificial Intelligence” (IEAI) with a focus on artificial intelligence that crosses discipline boundaries within AI, and between AI and other disciplines. Building systems often requires techniques from more than one area (e.g. both machine learning and natural language processing, or both planning and preference representation). In addition, larger systems often have AI components embedded within that provide intelligent functionalities such as learning and reasoning. The conference will include a special track dedicated to such work.
Website: http://ijcai-11.iiia.csic.es/

[5] ECML PKDD 2011
The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) will take place in Athens, Greece from September 5th to 9th, 2011. This event builds upon a very successful series of 21 ECML and 14 PKDD conferences, which have been jointly organized for the past ten years.
Website: http://www.ecmlpkdd2011.org/

Other Conferences:

[6] TDWI World Series Conference
What is your corporate data strategy? Most business and IT professionals point to their data models, data quality tools, or even a spreadsheet data “dictionary.” Maybe you’re just getting started with a new data quality project. Maybe you already have several data management initiatives in place and are making good progress. Or perhaps you’ve recently inherited the mess someone else made. In any case, does your right hand know what your left is up to? Several departmental data initiatives—even successful ones—do not equal a corporate data strategy. TDWI fosters a community of learning where business and technical professionals come together to gain knowledge and skills, network with peers, and advance their careers. Through education and research programs, TDWI enables individuals, teams, and organizations to leverage information to improve decision making, optimize performance, and achieve business objectives.
Website: http://events.tdwi.org/events/las-vegas-world-conference-2011/home.aspx?utm_source=AttendeeMktg&utm_medium=E-Mail&utm_campaign=lv20g

Realizing the popularity, and the need of Data Mining in the road ahead, has resulted into organizing many more conferences (also conferences into  many more specialized data mining streams) to be held all over the world. So, many more conferences, lots of research and great innovations and discoveries lay ahead.

  • Share/Bookmark

Online Data Mining Reference Book

Wednesday, December 1st, 2010

The online book was created by The Data Mining group was established in November 2000 by Dr. Saed Sayad in a collaboration with Professor Stephen T. Balke in the Department of Chemical Engineering and Applied Chemistry at the University of Toronto.

You can visit the group here:

http://chem-eng.utoronto.ca/~datamining/

You can access the book here:

http://chem-eng.utoronto.ca/~datamining/dmc/data_mining_map.htm

Features:

  • Quick Reference
  • Easy to memorize
  • Neat layout with good colored illustrations
  • Easy navigation
  • and more.

Happy Reading!

  • Share/Bookmark

Space…Go data mining go!

Sunday, October 17th, 2010

With many stars being born every day, and many more dieing and disappearing, the space is a massive challenge to deal with. Today, thanks to digital photography astronomers get to spend more time making sense of the sky, than just gazing at it with telescope every night.  The best thing that astronomer say, is the use of data mining and statistic techniques, which makes their job a lot easier.

Some key facts:

  • Key goal is learning about the changing sky faster and more efficiently.
  • Typically, 1.5 Million new observations every night.
  • Machine learning algorithms which recognize different galaxy types ranging from spiral to elliptical are improving and aiding the process very efficiently.
  • Its also important to note that more data also means increased problem space.

[ Read  more at Space]

  • Share/Bookmark

Data Mining Trends

Tuesday, September 21st, 2010

A trends statistics from google for search volume of data mining is as follows:

http://www.google.com/trends?q=Data+mining&ctab=0&geo=all&date=all

Topics of Interest:

A:  Business Intelligence and Data Mining
B: Data mining tells government and business a lot about you
C: Data mining is commonly used in business to find patterns
D: `Data mining’ may implicate innocent people in search for terrorists
E: ‘Data mining’ for drug companies goes to courts
F: IMS Health stock falls, as data mining ban pitched

Ranking according to countries (South Asia) of interest in datamining :
Country Ranking
India 1
Pakistan 2
Taiwan 3
Hong Kong 4
Iran 5
Indonesia 6
Singapore 7
South Korea 8
Malaysia 9
Thailand 10

A survey in 2010, for the data mining tools used revealed the interest of consumers in different data mining tools as follows:

This poll was conducted by KDnuggets ::http://www.kdnuggets.com/polls/2010/data-mining-analytics-tools.html and about 900 unique Data miners voted in the poll , but each were allowed multiple votes.

RapidMiner (345) 37.8%
R (272) 29.8%
Excel (222) 24.3%
KNIME (175) 19.2%
Your own code (168) 18.4%
Pentaho/Weka (131) 14.3%
SAS (110) 12.0%
MATLAB (84) 9.2%
IBM SPSS Statistics (72) 7.9%
Other free tools (67) 7.3%
IBM SPSS Modeler (former Clementine) (67) 7.3%
Microsoft SQL Server (63) 6.9%
Statsoft Statistica (57) 6.2%
Other commercial tools (56) 6.1%
SAS Enterprise Miner (50) 5.5%
Zementis (34) 3.7%
Orange (25) 2.7%
Oracle DM (19) 2.1%
KXEN (19) 2.1%
Salford CART Mars other (15) 1.6%
VisuaLinks (12) 1.3%
Viscovery (10) 1.1%
Angoss (8) 0.9%
TIBCO Insightful Miner (7) 0.8%
Miner3D (7) 0.8%
REvolution Computing (4) 0.4%
Megaputer Polyanalyst/TextAnalyst (3) 0.3%
Portrait Software (2) 0.2%
Data Applied (2) 0.2%
Centrifuge (2) 0.2%
PRSD Studio (1) 0.1%
Clario Analytics (1) 0.1%
Bayesia (1) 0.1%

Open Source Data mining tools:
Well, even open source data mining is on the rise. Weka , Orange , Rattle and Rapid miner are few open source software to name. The recent trends in use of data mining software also supports Open Source in a big way, the following is an analysis by KDnuggets which indicates the choice of type of software by users of various countries.

Manu C, Student Content Intern.

  • Share/Bookmark