<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title> &#187; Data Mining</title>
	<atom:link href="http://dataminingtools.net/blog/category/datamining/feed/" rel="self" type="application/rss+xml" />
	<link>http://dataminingtools.net/blog</link>
	<description></description>
	<lastBuildDate>Mon, 25 Jul 2011 08:51:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Google Predict Empowering Applications</title>
		<link>http://dataminingtools.net/blog/2011/05/21/google-predict-empowering-applications/</link>
		<comments>http://dataminingtools.net/blog/2011/05/21/google-predict-empowering-applications/#comments</comments>
		<pubDate>Sun, 22 May 2011 04:32:14 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[news]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=687</guid>
		<description><![CDATA[The Google Prediction API allows you to tap into Google’s machine learning algorithms that crunch data and give your possible outcomes, thereby helping you make your applications smarter.
Features

Lightweight RESTful API.
Asynchronous training.
Automatically selects from several available machine learning techniques.
Supported inputs: numeric data and unstructured text.
Outputs hundreds of discrete categories, or continuous values.
Gallery of pre-trained prediction models.
Ability to [...]]]></description>
			<content:encoded><![CDATA[<p>The Google Prediction API allows you to tap into Google’s machine learning algorithms that crunch data and give your possible outcomes, thereby helping you make your applications smarter.</p>
<h2>Features</h2>
<ul>
<li>Lightweight RESTful API.</li>
<li>Asynchronous training.</li>
<li>Automatically selects from several available machine learning techniques.</li>
<li>Supported inputs: numeric data and unstructured text.</li>
<li>Outputs hundreds of discrete categories, or continuous values.</li>
<li>Gallery of pre-trained prediction models.</li>
<li>Ability to add new training data on the fly.</li>
<li>Accessible from many platforms: Google App Engine, Apps Script (Google Spreadsheets), web &amp; desktop apps, and command line.</li>
</ul>
<p>Read More: <a href="http://code.google.com/apis/predict/">http://code.google.com/apis/predict/</a></p>
<!-- Easy AdSenser V2.40 -->
<!-- Post[count: 2] -->
<div style="float:left;margin:12px;" ><script type="text/javascript"><!--
google_ad_client = "pub-8731508784204217";
/* 120x90, created 5/22/09 */
google_ad_slot = "1258512664";
google_ad_width = 120;
google_ad_height = 90;
//-->
</script>
<script type="text/javascript"
src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
</script></div><p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2011%2F05%2F21%2Fgoogle-predict-empowering-applications%2F&amp;linkname=Google%20Predict%20Empowering%20Applications"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2011/05/21/google-predict-empowering-applications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>First look at Dhiti</title>
		<link>http://dataminingtools.net/blog/2011/05/21/first-look-at-dhiti/</link>
		<comments>http://dataminingtools.net/blog/2011/05/21/first-look-at-dhiti/#comments</comments>
		<pubDate>Sun, 22 May 2011 04:27:13 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=685</guid>
		<description><![CDATA[
Dhiti offers a RESTful API to our exploratory search platform. In short, our platform allows you to:

Upload a set of documents (html, or text) into a session.
Extract top topics, or concepts for a document, or a set of documents
Provide relevance feedback about articles, concepts or nuggets you like and dislike.
Recommendations subsequently change according to that.
Get [...]]]></description>
			<content:encoded><![CDATA[<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="480" height="390" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/bTcdZcHBR2g?fs=1&amp;hl=en_US" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="480" height="390" src="http://www.youtube.com/v/bTcdZcHBR2g?fs=1&amp;hl=en_US" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>Dhiti offers a RESTful API to our exploratory search platform. In short, our platform allows you to:</p>
<ul>
<li>Upload a set of documents (html, or text) into a session.</li>
<li>Extract top topics, or concepts for a document, or a set of documents</li>
<li>Provide relevance feedback about articles, concepts or nuggets you like and dislike.<br />
Recommendations subsequently change according to that.</li>
<li>Get recommendations of nuggets, articles or categories for a pivot. A pivot can be,
<ul>
<li>a url (of a document already added)</li>
<li>a string (treated as a query)</li>
<li>a category</li>
<li>your preferences – based on the relevance feedback</li>
</ul>
</li>
<li>Persist the session, along with your preferences</li>
</ul>
<p>Some applications of our API:</p>
<ul>
<li>Content discovery on publishing sites. <a href="http://dhiti.com/">Dhiti Dive</a>.</li>
<li>Explore pages and topics on any page on the web. <a href="http://drilll.com/">Drilll</a></li>
<li>Convert your incoming twitter stream into a research library. <a href="http://intweetion.com/">Intweetion</a></li>
<li>Get short, relevant previews from a book. Eg: <a href="http://nuggetize.com/theselfishgene?expand=1">Preview for the Selfish Gene</a></li>
</ul>
<p>More: <a href="http://dhiti.com/api/">http://dhiti.com/api/</a></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2011%2F05%2F21%2Ffirst-look-at-dhiti%2F&amp;linkname=First%20look%20at%20Dhiti"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2011/05/21/first-look-at-dhiti/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KNIME Beginner&#8217;s Luck</title>
		<link>http://dataminingtools.net/blog/2011/05/21/knime-beginners-luck/</link>
		<comments>http://dataminingtools.net/blog/2011/05/21/knime-beginners-luck/#comments</comments>
		<pubDate>Sun, 22 May 2011 04:04:41 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Education]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=680</guid>
		<description><![CDATA[KNIME (Konstanz Information Miner) is a user-friendly and comprehensive open-source data integration, processing, analysis, and exploration platform. 
&#8220;KNIME Beginner&#8217;s Luck&#8220; is a quick approach to KNIME for beginners.
More: Rosaria Silipo is a certified KNIME trainer and this book has been born from her lessons on KNIME and KNIME Reporting. It gives a detailed overview of the [...]]]></description>
			<content:encoded><![CDATA[<p>KNIME (Konstanz Information Miner) is a user-friendly and comprehensive open-source <span><span style="font-size: x-small;">data integration, processing, analysis, and exploration platform. </span></span></p>
<p>&#8220;KNIME Beginner&#8217;s Luck<a href="http://knime.org/knimepress/beginners-luck" target="_blank">&#8220;</a> is a quick approach to KNIME for beginners.</p>
<p>More: Rosaria Silipo is a certified KNIME trainer and this book has been born from her lessons on KNIME and KNIME Reporting. It gives a detailed overview of the main tools and philosphy of the KNIME data analysis platform. The goal is to empower new KNIME users with the necessary knowledge to start analysing, manipulating, and reporting even complex data.</p>
<p>Link: <a href="http://knime.org/knimepress/beginners-luck">http://knime.org/knimepress/beginners-luck</a></p>
<p><a rel="attachment wp-att-681" href="http://dataminingtools.net/blog/2011/05/21/knime-beginners-luck/beginnersluck_cover/"><img class="aligncenter size-medium wp-image-681" title="BeginnersLuck_cover" src="http://dataminingtools.net/blog/wp-content/uploads/2011/05/BeginnersLuck_cover-300x212.png" alt="" width="300" height="212" /></a></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2011%2F05%2F21%2Fknime-beginners-luck%2F&amp;linkname=KNIME%20Beginner%26%238217%3Bs%20Luck"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2011/05/21/knime-beginners-luck/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data mining &amp; Hip Hop</title>
		<link>http://dataminingtools.net/blog/2011/02/23/data-mining-hip-hop/</link>
		<comments>http://dataminingtools.net/blog/2011/02/23/data-mining-hip-hop/#comments</comments>
		<pubDate>Wed, 23 Feb 2011 19:16:58 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[news]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=675</guid>
		<description><![CDATA[Tahir Hemphil data mined 30 years of hip-hop lyrics to provide a searchable index of the genre’s lexicon.
The project analyzes the lyrics of over 40,000 songs for metaphors, similes, cultural references, phrases, memes and socio-political ideas.[Project] The project is one of its kind with a huge potential offering to the hip hop world, not only can you [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://tahirhemphill.com/portfolio/projects.html" target="_blank">Tahir Hemphil</a> data mined 30 years of hip-hop lyrics to provide a searchable index of the genre’s lexicon.</p>
<p>The project analyzes the lyrics of over 40,000 songs for metaphors, similes, cultural references, phrases, memes and socio-political ideas.[<a href="http://www.kickstarter.com/projects/1801076626/the-hip-hop-word-count-a-searchable-rap-almanac" target="_blank">Project</a>] The project is one of its kind with a huge potential offering to the hip hop world, not only can you visualize the artists career&#8217;s but also have deeper analysis into their world where you can potential patternize their music.</p>
<p><iframe frameborder="0" height="410px" src="http://www.kickstarter.com/projects/1801076626/the-hip-hop-word-count-a-searchable-rap-almanac/widget/video.html" width="480px"></iframe></p>
<p>Interesting Links:</p>
<ul>
<li><a href="http://www.eyebeam.org/taxonomy/term/5029">http://www.eyebeam.org/taxonomy/term/5029</a></li>
<li><a href="https://spreadsheets.google.com/ccc?key=0Aju92oYl3qVTdFUzdGZPVFh6Tld0YUd1VWhzaVd5ZFE&amp;hl=en#gid=0">https://spreadsheets.google.com/ccc?key=0Aju92oYl3qVTdFUzdGZPVFh6Tld0YUd1VWhzaVd5ZFE&amp;hl=en#gid=0</a></li>
<li><a href="http://www.hiphoparchive.org/">http://www.hiphoparchive.org/</a></li>
</ul>
<p>[Read more @ <a href="http://www.wired.com/epicenter/2011/02/datamining-hip-hops-history/" target="_blank">Wired</a>]</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2011%2F02%2F23%2Fdata-mining-hip-hop%2F&amp;linkname=Data%20mining%20%26%23038%3B%20Hip%20Hop"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2011/02/23/data-mining-hip-hop/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Data Mining Conferences 2010-2011</title>
		<link>http://dataminingtools.net/blog/2010/12/01/data-mining-conferences-2010-2011/</link>
		<comments>http://dataminingtools.net/blog/2010/12/01/data-mining-conferences-2010-2011/#comments</comments>
		<pubDate>Thu, 02 Dec 2010 00:04:13 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Conference]]></category>
		<category><![CDATA[news]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=664</guid>
		<description><![CDATA[A list of data mining research conferences, workshops, and meetings. Some of them are as follows:
[1] SIAM Conference on Data Mining (SDM 2011)
This conference provides a venue for researchers who are addressing these problems to present their work in a peer-reviewed forum. It also provides an ideal setting for graduate students and others new to [...]]]></description>
			<content:encoded><![CDATA[<p>A list of data mining research conferences, workshops, and meetings. Some of them are as follows:</p>
<p>[1] <strong>SIAM Conference on Data Mining</strong> (SDM 2011)<br />
This conference provides a venue for researchers who are addressing these problems to present their work in a peer-reviewed forum. It also provides an ideal setting for graduate students and others new to the field to learn about cutting-edge research by hearing outstanding invited speakers and attending tutorials (included with conference registration). A set of focused workshops are also held on the last day of the conference. The proceedings of the conference are published in archival form, and are also made available on the SIAM web site.<br />
Website: <a href="http://www.siam.org/meetings/sdm11/">http://www.siam.org/meetings/sdm11/</a></p>
<p>[2] <strong>SIGKDD Conference on Knowledge Discovery and Data Mining</strong><br />
The annual ACM SIGKDD conference is the premier international forum for data mining researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. KDD-2011 will feature keynote presentations, oral paper presentations, poster sessions, workshops, tutorials, panels, exhibits, demonstrations, and the KDD Cup competition.<br />
Website: <a href="http://kdd.org/kdd/2011/">http://kdd.org/kdd/2011/</a></p>
<p>[3] <strong>ACM SIGMOD 2011</strong><br />
The annual ACM SIGMOD/PODS conference is a leading international forum for database researchers, practitioners, developers, and users to explore cutting-edge ideas and results, and to exchange techniques, tools, and experiences.<br />
Website: <a href="http://www.sigmod2011.org/index.shtml">http://www.sigmod2011.org/index.shtml</a></p>
<p>[4] <strong>IJCAI 2011</strong><br />
The IJCAI-11 Program Committee invites submissions of technical papers for IJCAI-11, to be held in Barcelona, Spain, July 19-22, 2011. Submissions are invited on significant, original, and previously unpublished research on all aspects of artificial intelligence. The theme of IJCAI-11 is “Integrated and Embedded Artificial Intelligence” (IEAI) with a focus on artificial intelligence that crosses discipline boundaries within AI, and between AI and other disciplines. Building systems often requires techniques from more than one area (e.g. both machine learning and natural language processing, or both planning and preference representation). In addition, larger systems often have AI components embedded within that provide intelligent functionalities such as learning and reasoning. The conference will include a special track dedicated to such work.<br />
Website: <a href="http://ijcai-11.iiia.csic.es/">http://ijcai-11.iiia.csic.es/</a></p>
<p>[5]<strong> ECML PKDD 2011</strong><br />
The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) will take place in Athens, Greece from September 5th to 9th, 2011. This event builds upon a very successful series of 21 ECML and 14 PKDD conferences, which have been jointly organized for the past ten years.<br />
Website: <a href="http://www.ecmlpkdd2011.org/">http://www.ecmlpkdd2011.org/</a></p>
<p>Other Conferences:</p>
<p><strong>[6] TDWI World Series Conference</strong><br />
What is your corporate data strategy? Most business and IT professionals point to their data models, data quality tools, or even a spreadsheet data “dictionary.” Maybe you’re just getting started with a new data quality project. Maybe you already have several data management initiatives in place and are making good progress. Or perhaps you’ve recently inherited the mess someone else made. In any case, does your right hand know what your left is up to? Several departmental data initiatives—even successful ones—do not equal a corporate data strategy. TDWI fosters a community of learning where business and technical professionals come together to gain knowledge and skills, network with peers, and advance their careers. Through education and research programs, TDWI enables individuals, teams, and organizations to leverage information to improve decision making, optimize performance, and achieve business objectives.<br />
Website: <a href="http://events.tdwi.org/events/las-vegas-world-conference-2011/home.aspx?utm_source=AttendeeMktg&amp;utm_medium=E-Mail&amp;utm_campaign=lv20g">http://events.tdwi.org/events/las-vegas-world-conference-2011/home.aspx?utm_source=AttendeeMktg&amp;utm_medium=E-Mail&amp;utm_campaign=lv20g</a></p>
<p>Realizing the popularity, and the need of Data Mining in the road ahead, has resulted into organizing many more conferences (also conferences into  many more specialized data mining streams) to be held all over the world. So, many more conferences, lots of research and great innovations and discoveries lay ahead.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F12%2F01%2Fdata-mining-conferences-2010-2011%2F&amp;linkname=Data%20Mining%20Conferences%202010-2011"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/12/01/data-mining-conferences-2010-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online Data Mining Reference Book</title>
		<link>http://dataminingtools.net/blog/2010/12/01/online-data-mining-reference-book/</link>
		<comments>http://dataminingtools.net/blog/2010/12/01/online-data-mining-reference-book/#comments</comments>
		<pubDate>Wed, 01 Dec 2010 19:54:09 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[Tutorials]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=655</guid>
		<description><![CDATA[
The online book was created by The Data Mining group was established in November 2000 by Dr. Saed Sayad in a collaboration with Professor Stephen T. Balke in the Department of Chemical Engineering and Applied Chemistry at the University of Toronto.
You can visit the group here:
http://chem-eng.utoronto.ca/~datamining/
You can access the book here:
http://chem-eng.utoronto.ca/~datamining/dmc/data_mining_map.htm
Features:

Quick Reference
Easy to memorize
Neat layout with [...]]]></description>
			<content:encoded><![CDATA[<p><a rel="attachment wp-att-656" href="http://dataminingtools.net/blog/2010/12/01/online-data-mining-reference-book/dmbook/"><img class="aligncenter size-medium wp-image-656" title="dmbook" src="http://dataminingtools.net/blog/wp-content/uploads/2010/12/dmbook-300x168.png" alt="" width="300" height="168" /></a></p>
<p>The online book was created by The Data Mining group was established in November 2000 by Dr. Saed Sayad in a collaboration with Professor Stephen T. Balke in the Department of Chemical Engineering and Applied Chemistry at the University of Toronto.</p>
<p>You can visit the group here:</p>
<p><a href="http://chem-eng.utoronto.ca/~datamining/">http://chem-eng.utoronto.ca/~datamining/</a></p>
<p>You can access the book here:</p>
<p><a href="http://chem-eng.utoronto.ca/~datamining/dmc/data_mining_map.htm">http://chem-eng.utoronto.ca/~datamining/dmc/data_mining_map.htm</a></p>
<p>Features:</p>
<ul>
<li>Quick Reference</li>
<li>Easy to memorize</li>
<li>Neat layout with good colored illustrations</li>
<li>Easy navigation</li>
<li>and more.</li>
</ul>
<p>Happy Reading!</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F12%2F01%2Fonline-data-mining-reference-book%2F&amp;linkname=Online%20Data%20Mining%20Reference%20Book"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/12/01/online-data-mining-reference-book/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Space&#8230;Go data mining go!</title>
		<link>http://dataminingtools.net/blog/2010/10/17/space-go-data-mining-go/</link>
		<comments>http://dataminingtools.net/blog/2010/10/17/space-go-data-mining-go/#comments</comments>
		<pubDate>Sun, 17 Oct 2010 17:43:51 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Space]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=640</guid>
		<description><![CDATA[
With many stars being born every day, and many more dieing and disappearing, the space is a massive challenge to deal with. Today, thanks to digital photography astronomers get to spend more time making sense of the sky, than just gazing at it with telescope every night.  The best thing that astronomer say, is the use of [...]]]></description>
			<content:encoded><![CDATA[<p><a rel="attachment wp-att-641" href="http://dataminingtools.net/blog/2010/10/17/space-go-data-mining-go/massive-star-2/"><img class="aligncenter size-medium wp-image-641" title="massive-star-2" src="http://dataminingtools.net/blog/wp-content/uploads/2010/10/massive-star-2-300x176.jpg" alt="" width="300" height="176" /></a></p>
<p>With many stars being born every day, and many more dieing and disappearing, the space is a massive challenge to deal with. Today, thanks to digital photography astronomers get to spend more time making sense of the sky, than just gazing at it with telescope every night.  The best thing that astronomer say, is the use of data mining and statistic techniques, which makes their job a lot easier.</p>
<p>Some key facts:</p>
<ul>
<li>Key goal is learning about the changing sky faster and more efficiently.</li>
<li>Typically, 1.5 Million new observations every night.</li>
<li>Machine learning algorithms which recognize different galaxy types ranging from spiral to elliptical are improving and aiding the process very efficiently.</li>
<li>Its also important to note that more data also means increased problem space.</li>
</ul>
<p>[ Read  more at <a href="http://www.space.com/scienceastronomy/astronomy-data-mining-shifts-focus-from-stargazing-101011.html" target="_blank">Space</a>]</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F10%2F17%2Fspace-go-data-mining-go%2F&amp;linkname=Space%26%238230%3BGo%20data%20mining%20go%21"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/10/17/space-go-data-mining-go/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Mining Trends</title>
		<link>http://dataminingtools.net/blog/2010/09/21/data-mining-trends/</link>
		<comments>http://dataminingtools.net/blog/2010/09/21/data-mining-trends/#comments</comments>
		<pubDate>Tue, 21 Sep 2010 19:53:32 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=626</guid>
		<description><![CDATA[A trends statistics from google for search volume of data mining is as follows:
http://www.google.com/trends?q=Data+mining&#38;ctab=0&#38;geo=all&#38;date=all

Topics of Interest:
A:  Business Intelligence and Data Mining
B: Data mining tells government and business a lot about you
C: Data mining is commonly used in business to find patterns
D: `Data mining&#8217; may implicate innocent people in search for terrorists
E: &#8216;Data mining&#8217; for drug [...]]]></description>
			<content:encoded><![CDATA[<p>A trends statistics from google for search volume of data mining is as follows:</p>
<p><a href="http://www.google.com/trends?q=Data+mining&amp;ctab=0&amp;geo=all&amp;date=all">http://www.google.com/trends?q=Data+mining&amp;ctab=0&amp;geo=all&amp;date=all</a></p>
<p><a rel="attachment wp-att-628" href="http://dataminingtools.net/blog/2010/09/21/data-mining-trends/googletrends/"><img class="aligncenter size-medium wp-image-628" title="googletrends" src="http://dataminingtools.net/blog/wp-content/uploads/2010/09/googletrends-300x134.png" alt="" width="300" height="134" /></a></p>
<p>Topics of Interest:</p>
<p>A:  <a href="http://www.b-eye-network.com/info/index.php?jsessionid=669c6ea94a9cbd83822a523c62864037">Business Intelligence and Data Mining</a><br />
B: <a href="http://www.myrtlebeachonline.com/mld/myrtlebeachonline/news/politics/13766572.htm">Data mining tells government and business a lot about you</a><br />
C: <a href="http://www.sitnews.us/0506news/051206/051206_shns_phonedata.html">Data mining is commonly used in business to find patterns</a><br />
D: <a href="http://www.mercurynews.com/mld/mercurynews/news/politics/16421512.htm">`Data mining&#8217; may implicate innocent people in search for terrorists</a><br />
E: <a href="http://www.seattlepi.com/national/1110ap_us_drug_data_mining.html">&#8216;Data mining&#8217; for drug companies goes to courts</a><br />
F: <a href="http://www.seattlepi.com/business/1310ap_us_ims_health_mover.html">IMS Health stock falls, as data mining ban pitched</a></p>
<table border="0" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td>Ranking according to countries (South Asia) of interest in datamining   :</td>
</tr>
<tr>
<td>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="278" valign="top"><strong><span style="text-decoration: underline;">Country</span></strong></td>
<td width="278" valign="top"><strong><span style="text-decoration: underline;">Ranking</span></strong></td>
</tr>
<tr>
<td width="278" valign="top">India</td>
<td width="278" valign="top">1</td>
</tr>
<tr>
<td width="278" valign="top">Pakistan</td>
<td width="278" valign="top">2</td>
</tr>
<tr>
<td width="278" valign="top">Taiwan</td>
<td width="278" valign="top">3</td>
</tr>
<tr>
<td width="278" valign="top">Hong     Kong</td>
<td width="278" valign="top">4</td>
</tr>
<tr>
<td width="278" valign="top">Iran</td>
<td width="278" valign="top">5</td>
</tr>
<tr>
<td width="278" valign="top">Indonesia</td>
<td width="278" valign="top">6</td>
</tr>
<tr>
<td width="278" valign="top">Singapore</td>
<td width="278" valign="top">7</td>
</tr>
<tr>
<td width="278" valign="top">South     Korea</td>
<td width="278" valign="top">8</td>
</tr>
<tr>
<td width="278" valign="top">Malaysia</td>
<td width="278" valign="top">9</td>
</tr>
<tr>
<td width="278" valign="top">Thailand</td>
<td width="278" valign="top">10</td>
</tr>
</tbody>
</table>
</td>
</tr>
</tbody>
</table>
<p><a rel="attachment wp-att-629" href="http://dataminingtools.net/blog/2010/09/21/data-mining-trends/country/"><img class="aligncenter size-medium wp-image-629" title="country" src="http://dataminingtools.net/blog/wp-content/uploads/2010/09/country-300x141.png" alt="" width="300" height="141" /></a></p>
<p>A survey in 2010, for the data mining tools used revealed the interest of consumers in different data mining tools as follows:</p>
<p>This poll was conducted by KDnuggets ::<a href="http://www.kdnuggets.com/polls/2010/data-mining-analytics-tools.html">http://www.kdnuggets.com/polls/2010/data-mining-analytics-tools.html</a> and about 900 unique Data miners voted in the poll , but each were allowed multiple votes.</p>
<table border="1" cellspacing="0" cellpadding="0" width="100%">
<tbody>
<tr>
<td valign="top">RapidMiner (345)</td>
<td valign="top">37.8%</td>
</tr>
<tr>
<td valign="top">R (272)</td>
<td valign="top">29.8%</td>
</tr>
<tr>
<td valign="top">Excel (222)</td>
<td valign="top">24.3%</td>
</tr>
<tr>
<td valign="top">KNIME (175)</td>
<td valign="top">19.2%</td>
</tr>
<tr>
<td valign="top">Your own code (168)</td>
<td valign="top">18.4%</td>
</tr>
<tr>
<td valign="top">Pentaho/Weka (131)</td>
<td valign="top">14.3%</td>
</tr>
<tr>
<td valign="top">SAS (110)</td>
<td valign="top">12.0%</td>
</tr>
<tr>
<td valign="top">MATLAB (84)</td>
<td valign="top">9.2%</td>
</tr>
<tr>
<td valign="top">IBM SPSS Statistics (72)</td>
<td valign="top">7.9%</td>
</tr>
<tr>
<td valign="top">Other free tools (67)</td>
<td valign="top">7.3%</td>
</tr>
<tr>
<td valign="top">IBM SPSS Modeler (former   Clementine) (67)</td>
<td valign="top">7.3%</td>
</tr>
<tr>
<td valign="top">Microsoft SQL Server (63)</td>
<td valign="top">6.9%</td>
</tr>
<tr>
<td valign="top">Statsoft Statistica (57)</td>
<td valign="top">6.2%</td>
</tr>
<tr>
<td valign="top">Other commercial tools (56)</td>
<td valign="top">6.1%</td>
</tr>
<tr>
<td valign="top">SAS Enterprise Miner (50)</td>
<td valign="top">5.5%</td>
</tr>
<tr>
<td valign="top">Zementis   (34)</td>
<td valign="top">3.7%</td>
</tr>
<tr>
<td valign="top">Orange (25)</td>
<td valign="top">2.7%</td>
</tr>
<tr>
<td valign="top">Oracle DM (19)</td>
<td valign="top">2.1%</td>
</tr>
<tr>
<td valign="top">KXEN (19)</td>
<td valign="top">2.1%</td>
</tr>
<tr>
<td valign="top">Salford CART Mars other (15)</td>
<td valign="top">1.6%</td>
</tr>
<tr>
<td valign="top">VisuaLinks (12)</td>
<td valign="top">1.3%</td>
</tr>
<tr>
<td valign="top">Viscovery (10)</td>
<td valign="top">1.1%</td>
</tr>
<tr>
<td valign="top">Angoss (8)</td>
<td valign="top">0.9%</td>
</tr>
<tr>
<td valign="top">TIBCO Insightful Miner (7)</td>
<td valign="top">0.8%</td>
</tr>
<tr>
<td valign="top">Miner3D (7)</td>
<td valign="top">0.8%</td>
</tr>
<tr>
<td valign="top">REvolution Computing (4)</td>
<td valign="top">0.4%</td>
</tr>
<tr>
<td valign="top">Megaputer Polyanalyst/TextAnalyst   (3)</td>
<td valign="top">0.3%</td>
</tr>
<tr>
<td valign="top">Portrait Software (2)</td>
<td valign="top">0.2%</td>
</tr>
<tr>
<td valign="top">Data Applied (2)</td>
<td valign="top">0.2%</td>
</tr>
<tr>
<td valign="top">Centrifuge (2)</td>
<td valign="top">0.2%</td>
</tr>
<tr>
<td valign="top">PRSD Studio (1)</td>
<td valign="top">0.1%</td>
</tr>
<tr>
<td valign="top">Clario Analytics (1)</td>
<td valign="top">0.1%</td>
</tr>
<tr>
<td valign="top">Bayesia (1)</td>
<td>0.1%</td>
</tr>
</tbody>
</table>
<p>Open Source Data mining tools:<br />
Well, even open source data mining is on the rise. Weka , Orange , Rattle and Rapid miner are few open source software to name. The recent trends in use of data mining software also supports Open Source in a big way, the following is an analysis by KDnuggets which indicates the choice of type of software by users of various countries.</p>
<p><a rel="attachment wp-att-630" href="http://dataminingtools.net/blog/2010/09/21/data-mining-trends/trends/"><img class="aligncenter size-medium wp-image-630" title="trends" src="http://dataminingtools.net/blog/wp-content/uploads/2010/09/trends-300x156.png" alt="" width="300" height="156" /></a></p>
<p>Manu C, Student Content Intern.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F09%2F21%2Fdata-mining-trends%2F&amp;linkname=Data%20Mining%20Trends"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/09/21/data-mining-trends/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>RCOMM 2010: Rapid Miner Conference</title>
		<link>http://dataminingtools.net/blog/2010/07/16/rcomm-2010-rapid-miner-conference/</link>
		<comments>http://dataminingtools.net/blog/2010/07/16/rcomm-2010-rapid-miner-conference/#comments</comments>
		<pubDate>Sat, 17 Jul 2010 03:14:24 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Conference]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=601</guid>
		<description><![CDATA[RapidMiner Community Meeting And Conference &#8211; RCOMM 2010
As RapidMiner has once again proved to be the most-used open source data mining tool among the community of data analysts world-wide in a recent poll, it is now the time to give a face to that community. Therefore, Rapid-I hosts the first RapidMiner Community Meeting And Conference (RCOMM [...]]]></description>
			<content:encoded><![CDATA[<p>RapidMiner Community Meeting And Conference &#8211; RCOMM 2010</p>
<p>As RapidMiner has once again proved to be the most-used open source data mining tool among the community of data analysts world-wide in a recent poll, it is now the time to give a face to that community. Therefore, Rapid-I hosts the first <strong>RapidMiner Community Meeting And Conference (RCOMM 2010)</strong> and invites users and developers of RapidMiner to take part and share their RapidMiner experiences with other members of the community. The <strong>RCOMM 2010 </strong>intends to intensify the community life and strengthen the RapidMiner network by bringing together users and developers of RapidMiner from all backgrounds, may they be scientific or commercial, from the whole variety of applications and from all grades of knowledge. A vital exchange of ideas, application reports, and scientific results will help beginners to advance and will inspire the already advanced leading them to professionalism. Users will profit from in-depth knowledge of developers, who in turn will gain from picking up requirements and ideas for further development.</p>
<p>The <strong>RCOMM 2010 </strong>encompasses conference talks, in which invited lecturers will discuss aspects of state-of-the-art data mining with RapidMiner. A Call-for-Papers will be issued for those who would like to present their work in that scope. Workshops will be held to give participants a hands-on experience concerning several topics regarding RapidMiner usage. Additionally, attendees of the <strong>RCOMM 2010</strong> will also have the option to participate in several courses given by professional RapidMiner consultants in the surrounding of the user meeting.</p>
<div id="_mcePaste">Dates &amp; Deadlines:</div>
<div>Submission Deadline:August 6, 2010</div>
<div id="_mcePaste">Notification of Acceptance:August 13, 2010</div>
<div id="_mcePaste">Camera-ready Papers:August 20, 2010</div>
<div id="_mcePaste">Conference: September 13 &#8211; 16, 2010</div>
<p>Visit Conference Home: <a href="http://rapid-i.com/rcomm/index.php?option=com_frontpage&amp;Itemid=28" target="_blank">RCOMM</a><br />
Registration Link: <a href="http://rapid-i.com/rcomm/index.php?option=com_content&amp;task=view&amp;id=20&amp;Itemid=34" target="_blank">Registration</a><br />
Schedule Link: <a href="http://rapid-i.com/rcomm/index.php?option=com_content&amp;task=view&amp;id=14&amp;Itemid=29" target="_blank">Schedule</a></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F07%2F16%2Frcomm-2010-rapid-miner-conference%2F&amp;linkname=RCOMM%202010%3A%20Rapid%20Miner%20Conference"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/07/16/rcomm-2010-rapid-miner-conference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Watson! Dear Watson!</title>
		<link>http://dataminingtools.net/blog/2010/06/21/watson-dear-watson/</link>
		<comments>http://dataminingtools.net/blog/2010/06/21/watson-dear-watson/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 05:06:37 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=589</guid>
		<description><![CDATA[For the last few years, IBM scientists have been developing the most advanced &#8220;question answering&#8221; machine, able to understand a question posed by the user, and is expected to respond with a precise answer. In other words, it must do more than what search engines like Google and Bing do, which is merely point to [...]]]></description>
			<content:encoded><![CDATA[<p>For the last few years, IBM scientists have been developing the most advanced &#8220;question answering&#8221; machine, able to understand a question posed by the user, and is expected to respond with a precise answer. In other words, it must do more than what search engines like Google and Bing do, which is merely point to a set of results where you might find the answer. But Watson has to give the correct answer itself. Lets look at Watson in a trivia challenge:</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="450" height="405" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/FC3IryWr4c8&amp;hl=en_US&amp;fs=1&amp;rel=0&amp;color1=0xe1600f&amp;color2=0xfebd01&amp;border=1" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="450" height="405" src="http://www.youtube.com/v/FC3IryWr4c8&amp;hl=en_US&amp;fs=1&amp;rel=0&amp;color1=0xe1600f&amp;color2=0xfebd01&amp;border=1" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>[ <a href="http://cosmiclog.msnbc.msn.com/_news/2010/06/17/4524266-supercomputer-plays-jeopardy" target="_blank">MSNBC</a> ]</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F06%2F21%2Fwatson-dear-watson%2F&amp;linkname=Watson%21%20Dear%20Watson%21"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/06/21/watson-dear-watson/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

