<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title> &#187; Events</title>
	<atom:link href="http://dataminingtools.net/blog/tag/events/feed/" rel="self" type="application/rss+xml" />
	<link>http://dataminingtools.net/blog</link>
	<description></description>
	<lastBuildDate>Mon, 25 Jul 2011 08:51:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>RCOMM 2010: Rapid Miner Conference</title>
		<link>http://dataminingtools.net/blog/2010/07/16/rcomm-2010-rapid-miner-conference/</link>
		<comments>http://dataminingtools.net/blog/2010/07/16/rcomm-2010-rapid-miner-conference/#comments</comments>
		<pubDate>Sat, 17 Jul 2010 03:14:24 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Conference]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=601</guid>
		<description><![CDATA[RapidMiner Community Meeting And Conference &#8211; RCOMM 2010
As RapidMiner has once again proved to be the most-used open source data mining tool among the community of data analysts world-wide in a recent poll, it is now the time to give a face to that community. Therefore, Rapid-I hosts the first RapidMiner Community Meeting And Conference (RCOMM [...]]]></description>
			<content:encoded><![CDATA[<p>RapidMiner Community Meeting And Conference &#8211; RCOMM 2010</p>
<p>As RapidMiner has once again proved to be the most-used open source data mining tool among the community of data analysts world-wide in a recent poll, it is now the time to give a face to that community. Therefore, Rapid-I hosts the first <strong>RapidMiner Community Meeting And Conference (RCOMM 2010)</strong> and invites users and developers of RapidMiner to take part and share their RapidMiner experiences with other members of the community. The <strong>RCOMM 2010 </strong>intends to intensify the community life and strengthen the RapidMiner network by bringing together users and developers of RapidMiner from all backgrounds, may they be scientific or commercial, from the whole variety of applications and from all grades of knowledge. A vital exchange of ideas, application reports, and scientific results will help beginners to advance and will inspire the already advanced leading them to professionalism. Users will profit from in-depth knowledge of developers, who in turn will gain from picking up requirements and ideas for further development.</p>
<p>The <strong>RCOMM 2010 </strong>encompasses conference talks, in which invited lecturers will discuss aspects of state-of-the-art data mining with RapidMiner. A Call-for-Papers will be issued for those who would like to present their work in that scope. Workshops will be held to give participants a hands-on experience concerning several topics regarding RapidMiner usage. Additionally, attendees of the <strong>RCOMM 2010</strong> will also have the option to participate in several courses given by professional RapidMiner consultants in the surrounding of the user meeting.</p>
<div id="_mcePaste">Dates &amp; Deadlines:</div>
<div>Submission Deadline:August 6, 2010</div>
<div id="_mcePaste">Notification of Acceptance:August 13, 2010</div>
<div id="_mcePaste">Camera-ready Papers:August 20, 2010</div>
<div id="_mcePaste">Conference: September 13 &#8211; 16, 2010</div>
<p>Visit Conference Home: <a href="http://rapid-i.com/rcomm/index.php?option=com_frontpage&amp;Itemid=28" target="_blank">RCOMM</a><br />
Registration Link: <a href="http://rapid-i.com/rcomm/index.php?option=com_content&amp;task=view&amp;id=20&amp;Itemid=34" target="_blank">Registration</a><br />
Schedule Link: <a href="http://rapid-i.com/rcomm/index.php?option=com_content&amp;task=view&amp;id=14&amp;Itemid=29" target="_blank">Schedule</a></p>
<!-- Easy AdSenser V2.40 -->
<!-- Post[count: 2] -->
<div style="float:left;margin:12px;" ><script type="text/javascript"><!--
google_ad_client = "pub-8731508784204217";
/* 120x90, created 5/22/09 */
google_ad_slot = "1258512664";
google_ad_width = 120;
google_ad_height = 90;
//-->
</script>
<script type="text/javascript"
src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
</script></div><p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F07%2F16%2Frcomm-2010-rapid-miner-conference%2F&amp;linkname=RCOMM%202010%3A%20Rapid%20Miner%20Conference"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/07/16/rcomm-2010-rapid-miner-conference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>That&#8217;s one small step for robot, one giant leap for robotkind</title>
		<link>http://dataminingtools.net/blog/2010/01/03/thats-one-small-step-for-robot-one-giant-leap-for-robotkind/</link>
		<comments>http://dataminingtools.net/blog/2010/01/03/thats-one-small-step-for-robot-one-giant-leap-for-robotkind/#comments</comments>
		<pubDate>Sun, 03 Jan 2010 14:21:37 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Review]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Market Research]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=309</guid>
		<description><![CDATA[International Robot Exhibition 2009 has finished with a great success, this event was held during November 25, 2009 to November 28, 2009 at Tokyo International Exhibition Center in Tokyo,Japan. International Robot Exhibition 2009 show is designed to provide a place to exhibit robots and related equipments in order to enhance market awareness of new technology. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.nikkan.co.jp/eve/irex/english/index.html" target="_blank">International Robot Exhibition 2009 </a>has finished with a great success, this event was held during November 25, 2009 to November 28, 2009 at Tokyo International Exhibition Center in Tokyo,Japan. International Robot Exhibition 2009 show is designed to provide a place to exhibit robots and related equipments in order to enhance market awareness of new technology. At the same time, the show is to be a medium to promote new products and to develop new business through contributing the promotion of new technology. Some of the highlights have been illustrated below.</p>
<p>This is a clear indication of what we can expect in the near future.(Oh boy! Not another American Robot idol, The Amazing robot race, America&#8217;s next top robot model, or robot factor)</p>
<p>Read more about the event <a href="http://www.nikkan.co.jp/eve/irex/english/index.html" target="_blank">here</a> and for sure visit the compiled photo <a href="http://www.guardian.co.uk/technology/gallery/2009/dec/02/robots-japan?picture=356334757" target="_blank">gallery1</a> and <a href="http://pinktentacle.com/2009/11/photos-international-robot-exhibition-2009/" target="_blank">gallery2</a> for more pictures.</p>
<div id="attachment_311" class="wp-caption alignleft" style="width: 380px"><img class="size-full wp-image-311" title="CyberGlove-at-Internation-007" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/CyberGlove-at-Internation-007.jpg" alt="The robot hand is capable of 24 movements and can be remote-operated with the CyberGlove Photograph: Kim Kyung-hoon/Reuters" width="370" height="500" /><p class="wp-caption-text">The robot hand is capable of 24 movements and can be remote-operated with the CyberGlove Photograph: Kim Kyung-hoon/Reuters</p></div>
<div id="attachment_312" class="wp-caption alignleft" style="width: 460px"><img class="size-full wp-image-312 " title="Humanoid-industrial-robot-008" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/Humanoid-industrial-robot-008.jpg" alt="Humanoid industrial robot 'Motoman-SDA5D', developed by the Yaskawa Electric Corporation, demonstrates its capabilities with Lego Photograph: Dai Kurokawa/EPA" width="450" height="390" /><p class="wp-caption-text">Humanoid industrial robot &#39;Motoman-SDA5D&#39;, developed by the Yaskawa Electric Corporation, demonstrates its capabilities with Lego Photograph: Dai Kurokawa/EPA</p></div>
<div id="attachment_313" class="wp-caption alignleft" style="width: 448px"><img class="size-full wp-image-313" title="A-humanoid-robot-hip-hop--012" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/A-humanoid-robot-hip-hop-012.jpg" alt="A humanoid robot 'Manoi AT01', produced by Japan's toy robot maker Kyosho, performs a hip-hop dance Photograph: Yoshikazu Tsuno/AFP/Getty Images" width="438" height="390" /><p class="wp-caption-text">A humanoid robot &#39;Manoi AT01&#39;, produced by Japan&#39;s toy robot maker Kyosho, performs a hip-hop dance Photograph: Yoshikazu Tsuno/AFP/Getty Images</p></div>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F01%2F03%2Fthats-one-small-step-for-robot-one-giant-leap-for-robotkind%2F&amp;linkname=That%26%238217%3Bs%20one%20small%20step%20for%20robot%2C%20one%20giant%20leap%20for%20robotkind"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/01/03/thats-one-small-step-for-robot-one-giant-leap-for-robotkind/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The datamining journey so far ..</title>
		<link>http://dataminingtools.net/blog/2009/12/31/the-datamining-journey-so-far/</link>
		<comments>http://dataminingtools.net/blog/2009/12/31/the-datamining-journey-so-far/#comments</comments>
		<pubDate>Thu, 31 Dec 2009 12:09:41 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Review]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Market Research]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=281</guid>
		<description><![CDATA[This new year, let us go through all the major developments that have taken place in the Data Mining industry over the years. Here is a quick glimpse:

A description:



1993


Development   of WEKA begins: 

In 1993, the University of Waikato in New   Zealand started development of the original version of Weka.  Weka (Waikato [...]]]></description>
			<content:encoded><![CDATA[<p>This new year, let us go through all the major developments that have taken place in the Data Mining industry over the years. Here is a quick glimpse:</p>
<p><img class="alignleft size-full wp-image-303" title="datamining journey so far" src="http://dataminingtools.net/blog/wp-content/uploads/2009/12/final2-with-logo.png" alt="datamining journey so far" width="450" height="1157" /></p>
<p>A description:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top"><strong>1993</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Development   of WEKA begins: </strong>
<ul>
<li>In 1993, the University of Waikato in New   Zealand started development of the original version of Weka.  Weka (Waikato Environment for Knowledge   Analysis) is a popular suite of machine learning software written in Java,   developed at the University of Waikato. WEKA is free software available under   the GNU General Public License.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>1996</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>CRISP-DM   is conceived</strong>
<ul>
<li>CRISP-DM stands for CRoss Industry Standard   Process for Data Mining. It is a data mining process model that describes   commonly used approaches that expert data miners use to tackle problems.   Polls conducted later in 2002, 2004, and 2007 show that it is the leading   methodology used by data miners</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>1998</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>KXEN  established</strong>
<ul>
<li>Founded in 1998, KXEN has corporate offices   in San Francisco, California and Paris, France, with Fortune 1000 customers   around the world.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>1999</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>CRISP-DM   1.0 released</strong>
<ul>
<li>After it was conceived in 1996, in 1997   CRISP-DM got underway as a European Union project under the ESPRIT funding   initiative. The project was led by four companies: ISL, NCR   Corporation,Daimler-Benz and OHRA. The first version of the methodology was   released as CRISP-DM 1.0 in 1999.<strong> </strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2000</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>The &#8216;R&#8217;   Project considered stable for production</strong>
<ul>
<li>R is an implementation of the S programming   language with lexical scoping semantics inspired by Scheme. R was created by   Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand,   and is now developed by the R Development Core Team.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2003</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Appricon   established</strong>
<ul>
<li>In   order to provide a better data mining solution, Analysis Studio® and the   Analysis Studio® end-to-end logistic regression modeling solution were weaved   into enterprise data mining projects in 2003.<strong> </strong></li>
<li><strong>SAS   9.1 was released in 2003</strong><strong> </strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2004</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Rapidminer distributed with GNU   license</strong><strong> </strong>
<ul>
<li>The initial version has been developed by the   Artificial Intelligence Unit of <a title="Dortmund University of Technology" href="http://en.wikipedia.org/wiki/Dortmund_University_of_Technology">University of Dortmund</a> since   2001. It is distributed under a <a title="GNU" href="http://en.wikipedia.org/wiki/GNU">GNU</a> license, and   has been hosted by <a title="SourceForge" href="http://en.wikipedia.org/wiki/SourceForge">SourceForge</a>since 2004.</li>
<li><strong>SAS   9.1.2 was released in 2004.</strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2005</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Amazon   launches Mechanical Turk</strong>
<ul>
<li>The service was launched publicly on November   2, 2005. In early- to mid-November 2005, there were tens of thousands of   HITs, all of them uploaded to the system by Amazon itself for some of its   internal tasks that required human intelligence. Most of these were related   to music CD items.</li>
<li>The number of Amazon&#8217;s Mechanical Turk HITs in   the system soon decreased after its launch in november, and by December 20,   there were less than 100 groups of HITs on the average page load</li>
<li><strong>Weka   receives the SIGKDD Data Mining and Knowledge Discovery Service Award</strong></li>
<li><strong>SAS   9.1.3 was released in 2005.</strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2006</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Work on   CRISP-DM 2.0 begins</strong>
<ul>
<li>In July 2006 the consortium of CRISP-DM   announced that it was going to start the process of working towards a second   version of CRISP-DM. On 26 September 2006, the CRISP-DM SIG met to discuss potential   enhancements for CRISP-DM 2.0 and the subsequent roadmap.</li>
<li><strong>Pentaho   acquires exclusive …..</strong>
<ul>
<li>In 2006, Pentaho Corporation acquired an   exclusive license to use Weka for business intelligence. It forms the data   mining and predictive analytics component of the Pentaho business   intelligence suite.</li>
</ul>
</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2008</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>COGNOS   acquired by IBM</strong>
<ul>
<li>Cognos (Cognos Incorporated) was an Ottawa,   Ontario based company making business intelligence (BI) and performance   management software. On January 31, 2008, Cognos was officially acquired by   IBM. The Cognos name continues to be used, being applied to IBM&#8217;s line of   business intelligence (BI) and performance management products.</li>
<li><strong>SAS 9.2   is the latest release (March 2008</strong>) and was demonstrated at SAS Global   Forum (previously called SUGI) 2008.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2009</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>PASW/   SPSS</strong>
<ul>
<li>PASW (formerly SPSS) is a computer program   used for statistical analysis. Before 2009 it was called SPSS, but in 2009 it   was re-branded as PASW (Predictive Analytics Software). The company announced   July 28, 2009 that it was being acquired by IBM for US$1.2 billion.</li>
</ul>
</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Microsoft:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="54" valign="top">1996</td>
<td width="539" valign="top">
<ul>
<li>Microsoft   opens new team to build an OLAP product, codenamed Plato (permutation of   letters from OLAP)</li>
<li>Panorama   Software delegation meets with Microsoft</li>
<li>Microsoft   announces acquisition of Panorama Software development team</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">1997</td>
<td width="539" valign="top">
<ul>
<li>OLAP   Services 7.0 (codename Sphinx) ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2000</td>
<td width="539" valign="top">
<ul>
<li><strong>Analysis Services 2000</strong> (codename Shiloh) ships
<ul>
<li>Microsoft Analysis Services is part of   Microsoft SQL Server, a database management system. Microsofthas included a   number of services in SQL Server related to Business Intelligence and Data   Warehousing. These services include Integration Services and Analysis   Services. Analysis Services includes a group ofOLAP and Data Mining   capabilities.</li>
<li>Microsoft Corp. announces the <strong>beta release of the OLE DB for Data   Mining specification</strong>, a protocol based on the SQL language, that provides   software vendors and application developers with an open interface to more   efficiently integrate data mining tools and capabilities into   line-of-business and e-commerce applications.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2001</td>
<td width="539" valign="top">
<ul>
<li>XML for   Analysis SDK 1.0 ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2004</td>
<td width="539" valign="top">
<ul>
<li>ADOMD.NET   and XML for Analysis SDK 1.1 ship</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>Analysis   Services 2005 (codename Yukon) ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>Analysis   Services 2008 (codename Katmai) ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2009</td>
<td width="539" valign="top">
<ul>
<li>Microsoft has decided to make the BI Conference   into a biennial event, with the next conference in 2010. For 2009, we are   excited to team with the Professional Association for SQL Server (PASS) to   expand the BI tracks at PASS Summit 2009 and help deliver the content that BI   architects, developers, and administrators need to get the most value from   their Microsoft SQL Server and BI-based solutions.</li>
<li><strong>PowerPivot </strong>gives users the power to create compelling self-service BI solutions,   facilitates sharing and collaboration on user-generated BI solutions in a   Microsoft SharePoint Server 2010 environment, and enables IT organizations to   increase operational efficiencies through Microsoft SQL Server 2008 R2-based   management tools.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Amazon:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="54" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>&#8220;Search Inside the Book&#8221; is a   feature which allows customers to search for keywords in the full text of   many books in the catalog. The feature started with 120,000 titles (or   33 million pages of text) on October 23, 2003. There are currently about   250,000 books in the program. Amazon has cooperated with around 130 publishers to   allow users to perform these searches.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>In November 2005, Amazon.com began   testing Amazon Mechanical Turk, an application programming   interface (API) allowing programs to dispatch tasks to human processors.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2006</td>
<td width="539" valign="top">
<ul>
<li>Amazon launched an online storage service   called Amazon Simple Storage Service (Amazon S3). An unlimited   number of data objects, from 1 byte to 5 gigabytes in   size, can be stored in S3 and distributed via HTTP or <a title="BitTorrent (protocol)" href="http://en.wikipedia.org/wiki/BitTorrent_(protocol)">BitTorrent</a> .In April 2006, Amazon   introduced Amazon Simple Queue Service (Amazon SQS), a distributed   queue messaging service.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li>In January 2007 Amazon launched <a title="Amapedia" href="http://en.wikipedia.org/wiki/Amapedia">Amapedia</a>, a   collaborative wiki for user-generated content to replace ProductWiki</li>
<li>In December 2007, Amazon introduced <a title="SimpleDB" href="http://en.wikipedia.org/wiki/SimpleDB">SimpleDB</a>, a   database system, allowing users of its other infrastructure to utilize a high   reliability high performance database system.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>Amazon   Web Services launched a public beta of Amazon Elastic Compute Cloud running   Microsoft Windows Server and Microsoft SQL Server.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Yahoo!:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top">2002</td>
<td width="539" valign="top">
<ul>
<li><strong>Yahoo!   HotJobs</strong>, previously known as hotjobs.com, is an online job search engine.   It has been known as Yahoo! HotJobs since being acquired by Yahoo! in 2002.   Yahoo! HotJobs provides tools and advice for job seekers, employers, and   staffing firms.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Introduces Smartsort Technology:   Personalized Product Recommendation Tool
<ul>
<li>The new Yahoo! Product Search powers the   redesigned Yahoo! Shopping, providing consumers with the most comprehensive   and relevant comparison-shopping site on the Web. The redesigned Yahoo!   Shopping now boasts a variety of comparison-shopping features including:   side-by-side product comparison, detailed buyer&#8217;s guides, tax and shipping   calculator tool, consumer product and merchant ratings, unbiased expert   product reviews etc. Yahoo! Shopping is the third largest multi-category   commerce destination on the Web. (Nielsen//NetRatings, August 2003)</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2004</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Launches <strong>SmartView </strong>Technology: New Mapping Feature Creates Customized   Visual Search Capability</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Search Launches <strong>Search Subscriptions Beta</strong>, Providing Select Deep Web Content to   Users</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2006</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Opens <strong>Internet Time Capsule </strong>to Capture Life in 2006
<ul>
<li>SUNNYVALE, Calif., October 10, 2006 – Yahoo!   Inc. (Nasdaq:YHOO) today announced the launch of what is expected to be the   world&#8217;s largest time capsule in history. Starting today, Yahoo! is   encouraging people from around the world to contribute personal photos,   stories, thoughts, ideas, poems, home movies and art to this first-ever   electronic&#8230;</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li><strong>Yahoo! pipes</strong>: Yahoo! Pipes was released to the public in beta on 7 February   2007.Yahoo! Pipes is a web application from Yahoo! that provides a graphical   user interface for building data mashups that aggregate web feeds, web pages,   and other services, creating Web-based apps from various sources, and   publishing those apps. The application works by enabling users to   &#8220;pipe&#8221; information from different sources and then set up rules for   how that content should be modified (for example, filtering).<strong> </strong></li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>The   software, called <strong>Hadoop</strong>, is part   of Yahoo&#8217;s massive computing grid and is transforming the way Yahoo and   corporate giants such as IBM extract meaning from enormous streams of data.   Universities are also using the code &#8211; an open-source version of software   Google relies on for daily operation &#8211; to train a new generation of computer   scientists and engineers. On February 19, 2008, Yahoo! launched what it   claimed was the world&#8217;s largest Hadoop production application. The Yahoo!   Search Webmap is a Hadoop application that runs on a more than 10,000 <a title="Multi-core" href="http://en.wikipedia.org/wiki/Multi-core">core</a> <a title="Linux" href="http://en.wikipedia.org/wiki/Linux">Linux</a> <a title="Cluster (computing)" href="http://en.wikipedia.org/wiki/Cluster_(computing)">cluster</a> and   produces data that is now used in every Yahoo! Web search query.</li>
</ul>
<ul>
<li><strong>Yahoo   joins OPEN SOCIAL</strong>: On Mar 25, 2008 Yahoo! also announced it has joined   the initiative . OpenSocial is a set of common application programming   interfaces (APIs) for web-based social network applications, developed by   Google along with MySpace and a number of other social networks. It was   released November 1, 2007. Applications implementing the OpenSocial APIs will   be interoperable with any social network system that supports them, including   features on sites such as Hi5.com, MySpace, orkut, Netlo], Sonico.com,   Friendster, Ning and Yahoo!.</li>
</ul>
<ul>
<li>Yahoo! Inc. announces the general availability   of <strong>Fire Eagle</strong> (http://fireeagle.yahoo.net), an open platform that helps users take their   location to the Web while giving them the ability to easily control how and   where their location data</li>
</ul>
<ul>
<li><strong>Yahoo!   Opens Up Search Technology Infrastructure for Innovative, New Search   Experiences, Providing Third Parties with Unprecedented Access, Re-Ranking   and Presentation Control of Web Search Results:</strong>
<ul>
<li><strong>BOSS:   Build your own search service</strong>:   The main goal and idea of BOSS is to give   users, in this case developers, free access to the <a title="Yahoo! Search" href="http://en.wikipedia.org/wiki/Yahoo!_Search">Yahoo! Search</a> <a title="Index (search engine)" href="http://en.wikipedia.org/wiki/Index_(search_engine)">index</a>.   The results can be supplied into the developer&#8217;s website or program so that   they can manipulate the resources according to their product&#8217;s requirements.   BOSS allows the results to be returned back in <a title="XML" href="http://en.wikipedia.org/wiki/XML">XML</a>, <a title="JSON" href="http://en.wikipedia.org/wiki/JSON">JSON</a>, <a title="HTML" href="http://en.wikipedia.org/wiki/HTML">HTML</a>, <a title="Text" href="http://en.wikipedia.org/wiki/Text">text</a> and   also allows the comprehensive search feature allowed in Yahoo like pulling   the results by pages, searching inside <a title="PDF" href="http://en.wikipedia.org/wiki/PDF">PDF</a>,   etc. The ranking of the websites for a search term is same as the <a title="Yahoo! Search" href="http://en.wikipedia.org/wiki/Yahoo!_Search">Yahoo! Search</a>ranking since both of these are   pulling from the same index and ranking.</li>
</ul>
</li>
</ul>
<p><strong> </strong></td>
</tr>
<tr>
<td width="45" valign="top">2009</td>
<td width="539" valign="top">
<ul>
<li>On June   10, 2009, Yahoo! released its own distribution of Hadoop.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Google:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top">1998</td>
<td width="539" valign="top">
<ul>
<li>Google   sets up workspace in Susan Wojcicki&#8217;s garage at <a href="http://maps.google.com/maps?q=232+santa+margarita,+menlo+park+ca&amp;ie=UTF8&amp;oe=utf-8&amp;client=firefox-a&amp;ll=37.457861,-122.163312&amp;spn=0.008431,0.019999&amp;z=16&amp;iwloc=addr">232 Santa Margarita, Menlo Park</a>.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2000</td>
<td width="539" valign="top">
<ul>
<li>The   first <a href="http://www.google.com/press/pressrel/pressrelease22.html">10 language versions of Google.com</a> are   released</li>
<li>Google   forges a <a href="http://www.google.com/press/pressrel/pressrelease25.html">partnership with Yahoo!</a> to   become their default search provider.</li>
<li>Google   search index reaches 1 billion pages</li>
<li>Google   toolbar is launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2001</td>
<td width="539" valign="top">
<ul>
<li>Image   Search <a href="http://www.google.com/googlefriends/jul2001.html">launches</a>, offering access to 250 million   images.</li>
<li>Google   is available in 26 languages</li>
<li>Search   index reaches 3 billion mark.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2002</td>
<td width="539" valign="top">
<ul>
<li>The   first Google hardware is <a href="http://news.cnet.com/Google-aims-search-device-at-companies/2100-1023_3-833905.html">released</a>: it&#8217;s a yellow box   called the <a href="http://www.google.com/enterprise/gsa/">Google Search Appliance</a> that   businesses can plug into their computer network to enable search capabilities   for their own documents.</li>
<li>Google   releases a <a href="http://www.google.com/press/pressrel/select.html">major overhaul</a> for <a href="https://adwords.google.com/">AdWords</a>,   including new cost-per-click pricing.</li>
<li>Google   releases a <a href="http://www.infoworld.com/articles/hn/xml/02/04/11/020411hngoogleapi.html">set of APIs</a>, enabling   developers to query more than 2 billion web documents and program in their   favorite environment, including Java, Perl and Visual Studio.</li>
<li>Users   can search for stuff to buy with <a href="http://searchenginewatch.com/showPage.html?page=2161381">Froogle</a> (later   called <a href="http://www.google.com/products">Google Product Search</a>).</li>
<li>Partnership   with AOL</li>
<li>Google   Labs is launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>Google   announces a new <a href="http://www.google.com/press/pressrel/advertising.html">content-targeted advertising service</a>,   enabling publishers large and small to access Google&#8217;s vast network of   advertisers. (Weeks later, on April 23, we acquired Applied Semantics, whose   technology bolsters the service named <a href="https://www.google.com/adsense">AdSense</a>.)</li>
<li>Google   acquires blogger.com</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2004</td>
<td width="539" valign="top">
<ul>
<li>Search   index reaches 8 billion</li>
<li>Orkut   released</li>
<li>Keyhole   Acquired</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>Urchin   acquired</li>
<li>Google   Maps, code.google.com launched</li>
<li>Google   image search boasts of 1.1 billion images.</li>
<li>iGoogle   launched</li>
<li>Google   Earth, Google talk launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2006</td>
<td width="539" valign="top">
<ul>
<li>YouTube   acquired</li>
<li>Jotspot   acquired</li>
<li>Google   docs and spreadsheets launched</li>
<li>Google   custom search launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li>Google   hot trends launched</li>
<li>Partnership   with salesforce.com</li>
<li>Postini   acquired</li>
<li>Joint   supercomputing project with IBM</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>DoubleClick   acquired</li>
<li>Google   index: 1 trillion</li>
<li>Google   Chrome browser launched</li>
<li>Google   tracks flu trends</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>IBM:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top">1995</td>
<td width="539" valign="top">
<ul>
<li>IBM acquires Lotus</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">1996</td>
<td width="539" valign="top">
<ul>
<li>IBM launches its DB2 relational database.</li>
<li>IBM acquires Tivoli.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">1998</td>
<td width="539" valign="top">
<ul>
<li>IBM launches the PowerPC 740/750 processors,   the world&#8217;s first manufactured using IBM&#8217;s copper manufacturing technology.</li>
<li>Two new AS/400s are introduced, as well as new   products in the Aptiva, PC, and Thinkpad series.The IBM S/390 computing system   for business is also launched.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">1999</td>
<td width="539" valign="top">
<ul>
<li>The S/390 G6 server, using IBM&#8217;s cop per   technology, is introduced.</li>
<li>IBM and Dell sign a $16 billion technoogy   agreement, where Dell will purchase IBM components for use in Dell systems.</li>
<li>IBM and Lotus found the Institute for Knowledge   Management.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2000</td>
<td width="539" valign="top">
<ul>
<li>IBM launches the NetVista line of PC devices.</li>
<li>IBM launches the eServer line.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2002</td>
<td width="539" valign="top">
<ul>
<li>Product offerings during 2002 include the   eServer p650 eight-way UNIX server, the eServer i890, and the IBM eServer xSeries   440.</li>
<li>IBM acquires Price Waterhouse Coopers&#8217; business   consulting and technology services unit for $3.5 billion in cash and stock.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>IBM and Cisco announce a set of open software   technologies designed to advance the development of &#8220;self-healing&#8221;   computer systems and networks.</li>
<li>IBM and Siebel launch CRM OnDemand.</li>
<li>IBM launches its WebSphere business integration   software.</li>
<li>Japan&#8217;s largest research organization orders   an AMD Opteron based eServer 325 supercomputer, running Linux.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>IBM plans to expand its data-integration   product line through a $1.1 billion acquisition of Ascential Software Corp.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li>Google and I.B.M. Join hands  in ‘Cloud Computing’ Research</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>Researchers with IBM have developed a new set   of software applications designed to improve the human memory. The software   is designed to run on a smartphone or mobile handset and analyze collected   pieces of data. The collected data is then used to help the user better   remember faces and other information such as conversations.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2009</td>
<td width="539" valign="top">
<ul>
<li>IBM boasts that its so-called Sequoia system   will be capable of crunching numbers 20 times faster than IBM&#8217;s last   record-breaker and 15 times faster than the current fastest machine.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Sources:</p>
<ul>
<li><a href="/DataMiningTools/top%2015%20datamining%20companies%20blog%20post/wikipedia.org"> Wikipedia</a>,</li>
<li><a href="http://docs.yahoo.com/info/pr/releases.html">Yahoo! Media Relations</a>,</li>
<li> <a href="http://www.microsoft.com/Presspass/default.mspx">Microsoft PressPass</a>,</li>
<li> <a href="http://www.google.com/press/">Google Press Center</a>,</li>
<li><a href="http://www.google.com/corporate/timeline/#start">Google Timeline</a></li>
<li><a href="http://powerpivotpro.com/">PowerPivotPro.com</a></li>
<li><a href="http://research.microsoft.com/en-us/">Microsoft Research</a></li>
<li><a href="http://phx.corporate-ir.net/phoenix.zhtml?p=irol-mediaHome&amp;c=176060">Amazon Media Room</a></li>
<li><a href="http://www.informationweek.com/">InformationWeek</a></li>
<li><a href="http://www.nytimes.com/">NYTimes</a></li>
</ul>
<p>&#8211;  SAGAR JAUHARI, SDE Intern.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2009%2F12%2F31%2Fthe-datamining-journey-so-far%2F&amp;linkname=The%20datamining%20journey%20so%20far%20.."><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2009/12/31/the-datamining-journey-so-far/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Tim Berners-Lee on the next Web</title>
		<link>http://dataminingtools.net/blog/2009/09/24/tim-berners-lee-on-the-next-web/</link>
		<comments>http://dataminingtools.net/blog/2009/09/24/tim-berners-lee-on-the-next-web/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 12:50:36 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[news]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=106</guid>
		<description><![CDATA[Lets hear what Tim Berner&#8217;s Lee talks about the next Web! Yes, its Linked Data! We need data patterns to understand users, linked data is the answer, and raw or unadulterated data is what we should ask for.



]]></description>
			<content:encoded><![CDATA[<p><span style="font-family: arial, helvetica, sans-serif;">Lets hear what Tim Berner&#8217;s Lee talks about the next Web! Yes, its Linked Data! We need data patterns to understand users, linked data is the answer, and raw or unadulterated data is what we should ask for.</span></p>
<p><span style="font-family: arial, helvetica, sans-serif; font-weight: normal; color: #000000; padding: 0px; margin: 0px;"><br />
</span></p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="446" height="326" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="wmode" value="transparent" /><param name="bgColor" value="#ffffff" /><param name="flashvars" value="vu=http://video.ted.com/talks/dynamic/TimBerners-Lee_2009-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/TimBerners-Lee-2009.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=484&amp;introDuration=16500&amp;adDuration=4000&amp;postAdDuration=2000&amp;adKeys=talk=tim_berners_lee_on_the_next_web;year=2009;theme=what_s_next_in_tech;event=TED2009;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;" /><param name="src" value="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" /><param name="bgcolor" value="#ffffff" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="446" height="326" src="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" flashvars="vu=http://video.ted.com/talks/dynamic/TimBerners-Lee_2009-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/TimBerners-Lee-2009.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=484&amp;introDuration=16500&amp;adDuration=4000&amp;postAdDuration=2000&amp;adKeys=talk=tim_berners_lee_on_the_next_web;year=2009;theme=what_s_next_in_tech;event=TED2009;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;" bgcolor="#ffffff" wmode="transparent" allowfullscreen="true"></embed></object></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2009%2F09%2F24%2Ftim-berners-lee-on-the-next-web%2F&amp;linkname=Tim%20Berners-Lee%20on%20the%20next%20Web"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2009/09/24/tim-berners-lee-on-the-next-web/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WEKA pens victory at the University of California San Diego student datamining competition</title>
		<link>http://dataminingtools.net/blog/2009/09/23/weka/</link>
		<comments>http://dataminingtools.net/blog/2009/09/23/weka/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 05:47:56 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=86</guid>
		<description><![CDATA[A machine learning algorithm processes a very large dataset to find the best fit in the observed data to learn, and also makes use of the prior knowledge of the system for learning or forecasting. Data mining forms an integral part of the search and understanding of the electronically stored data.
Graduate student Quan Sun’s win [...]]]></description>
			<content:encoded><![CDATA[<p>A machine learning algorithm processes a very large dataset to find the best fit in the observed data to learn, and also makes use of the prior knowledge of the system for learning or forecasting. Data mining forms an integral part of the search and understanding of the electronically stored data.</p>
<p><span style="font-family: Verdana, Arial, Helvetica, sans-serif; line-height: 17px; font-size: 12px;"><span style="font-family: Verdana; font-size: x-small;">Graduate student Quan Sun’s win at the University of California San Diego student datamining competition surely highlights<a href="http://www.cs.waikato.ac.nz/ml/weka/"> Weka</a>. Sun claims he used open source software to claim his win in the elite “hard” category for graduate students. In fact, says Sun, at least half the competitors in the competition used the software, called Weka, which he describes as the “Microsoft Word of data mining”.</span></span></p>
<p>Waikato Environment for Knowledge Analysis (WEKA), developed at the University of Waikato, New Zealand, is a collection of machine learning algorithms. It has data preprocessing tools to provide inputs to these algoritms.This tool uses Java as its base and is compatible with both Windows and Linux. It is open source software available under the terms of GNU General Public License. Inputs to the machine learning algorithms implemented in WEKA are in the form of relational tables in Attribute Relational File Format (ARFF).</p>
<p>Some of the key features of <a href="http://www.cs.waikato.ac.nz/ml/weka/">WEKA</a></p>
<ul>
<li>WEKA has inbuilt format converters to convert the dataset available in any format say a spreadsheet to ARFF type. In addition it also incorporates filter to delete specified attributes from the dataset.</li>
<li>WEKA trains and tests the learning algorithms that perform classification and regression and also allows the user to create his own classifier interactively.</li>
<li>WEKA allows the user to handle cluster and their instances.Inaddition it provides access to several methods for attribute selection that might involve either the full data set or a cross validation.</li>
</ul>
<p>Working with WEKA is very simple. This is mainly because it uses a GUI Explorer. The classification process is not cumbersome, as it involves the selection of the attributes to be related and the algorithm to be used, by the user. The results provide a matrix of both the classified and the misclassified data. The classification error mentioning the mean and the standard deviation are also displayed. WEKA helps in realizing the goal of data mining; by predicting missing values and validating that the predicted values are correct. WEKA is a tool that permits users to develop and analyze new machine learning algorithms to make their job easier.</p>
<p>What can <a href="http://www.cs.waikato.ac.nz/ml/weka/">weka </a>do?</p>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">reprocess – Weka has file format converters for spreadsheets, C4.5 file formats and serialized instances. It can also open a URL and use HTTP to download an ARFF file from the Web or open a database using JDBC, and retrieve instances using SQL. It also provides a list of filters to delete specified attributes from a dataset.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Direct Hit!</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Applies To: Researchers in Data Mining and Artificial Intelligence</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">USP: Applying machine learning algorithms for data mining</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Primary Link: www.cs.waikato.ac.nz/ml/weka Search Engine</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Keywords: Machine Learning, Data Mining</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Classify – Weka trains and tests learning schemes that perform classification or regression. The classifiers can be divided into Bayesian, trees, rules, functions and lazy. It also builds a linear regression model and allows the user to build their own classifiers interactively. It also provides options for a number of meta learners.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Cluster – Weka shows the clusters and the number of instances in the cluster. Thereafter it determines the majority class in each cluster and gives the confusion matrix.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Associate – Weka contains three algorithms for determining</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">association rules-apriory, predictive apriory and filtered associators. It has no methods for evaluating such rules.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Attribute Selection – Weka gives access to several methods for attribute selection, which involves an attribute evaluator and a search method. Attribute selection can be performed using the full training set or cross-validation.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;"><span style="white-space: pre;"> </span></div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">In the Preprocess tab, you can view attributes in the input file, properties of the selected attribute, and visualisation of class distribution for each attribute.<span style="white-space: pre;"> </span>Building a Naïve Bayes Classifier with 10 fold cross-validation. The correctly classified instances can be viewed by right clicking on Classifier in Results Window.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Visualization &#8211; It displays a matrix of two-dimensional scatter plots of each pair of attributes.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Preparing input</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Major effort in the process of data mining/machine learning goes into the preparation of input. In order to analyze data using Weka, you need to prepare it in the Attribute Relation File Format (ARFF) and then load it in its Explorer. Spreadsheets, Comma Separated Value (CSV) files and databases can be converted to ARFF. In ARFF, there is an @relation tag, @attribute tag and @data tag to represent the dataset name, attribute information and values respectively.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Classifying data</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Weka should preferably be used through a graphical user interface called &#8216;Explorer&#8217; than the command-line interface. The other two interfaces are &#8216;Knowledge Flow Interface,&#8217; which supports design configuration for streamed data processing and &#8216;Experimenter,&#8217; which helps users compare a variety of learning techniques. In this example, we use an ARFF named age.arff which contains a few selective words in the attribute and @data contains their number of occurrences per 10,000 words in a blog dataset written by bloggers belonging to various age groups.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">1. Open the file you want to analyze using the Open file option in the Preprocess tab in Weka explorer, ie open the age data file, age.arff.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">2. Once the input file has been opened, all attributes in the input file are shown in the Attributes Window. Properties of the selected attributes like Attribute Name, Attribute Type, number of missing values, etc are displayed in the &#8216;Selected Attribute&#8217; window. Here, you can select attributes that you want to include in working relations, eg age prediction.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">3. Select the classifier algorithm in the Classify tab. In this example, we selected Naïve Bayes with 10 fold Cross-Validation. Next, click on Start. The result is displayed in the Classifier Output window as shown in figure on the left.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;"><span style="white-space: pre;"> </span></div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Output of the Naïve Bayes Classifier in terms of errors, accuracy by class and confusion matrix, on Age dataset.<span style="white-space: pre;"> </span>View of an ARFF dataset which consists of a list of instances, and the attribute values for each instance separated by commas.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Analyzing the result</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">The result displays the summary of the data set followed by the algorithm used to analyze it. It also gives the predictive performance of the machine-learning algorithm applied on the dataset. Thereafter the confusion matrix displays the number of instances classified properly and those misclassified. The classification error is displayed mentioning the mean absolute error and the root mean squared error of the class probability estimates.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Processing huge datasets</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">If the dataset is too huge, running to a few thousand attributes and a few lakh records, it can happen that Weka runs into an &#8216;OutOfMemory&#8217; exception. Most Java virtual machines allocate a certain maximum amount of memory which is much less than the amount of RAM to run Java programs. However, we can extend the memory available for the virtual machine by setting appropriate options. Alternately, Weka offers several filters for re-sampling a dataset and generating a new dataset reduced in size. Besides, there are schemes that can be trained in an incremental fashion, not just in batch mode unlike most classifiers which require all the data before they can be trained. Such a classifier will load the dataset incrementally and feed the data instance by instance to the classifier.</div>
<p>Preprocess – Weka has file format converters for spreadsheets, C4.5 file formats and serialized instances. It can also open a URL and use HTTP to download an ARFF file from the Web or open a database using JDBC, and retrieve instances using SQL. It also provides a list of filters to delete specified attributes from a dataset.</p>
<p>Classify – Weka trains and tests learning schemes that perform classification or regression. The classifiers can be divided into Bayesian, trees, rules, functions and lazy. It also builds a linear regression model and allows the user to build their own classifiers interactively. It also provides options for a number of meta learners.</p>
<p>Cluster – Weka shows the clusters and the number of instances in the cluster. Thereafter it determines the majority class in each cluster and gives the confusion matrix.</p>
<p>Associate – Weka contains three algorithms for determining association rules-apriory, predictive apriory and filtered associators.</p>
<p>Attribute Selection – Weka gives access to several methods for attribute selection, which involves an attribute evaluator and a search method. Attribute selection can be performed using the full training set or cross-validation.</p>
<p>Visualization &#8211; It displays a matrix of two-dimensional scatter plots of each pair of attributes.</p>
<p><a href="http://videolectures.net/bootcamp07_belanche_mldm/"><br />
<img style="border: 0px initial initial;" src="http://videolectures.net/bootcamp07_belanche_mldm/thumb.jpg" border="0/" alt="" /></a></p>
<p><a href="http://videolectures.net/bootcamp07_belanche_mldm/"> </a></p>
<p><a href="http://videolectures.net/bootcamp07_belanche_mldm/">Other ML/DM software (R, Weka, Yale)</a></p>
<p>Lluís Belanche</p>
<p>Processing huge datasets</p>
<p>If the dataset is too huge, running to a few thousand attributes and a few lakh records, can lead Weka into &#8216;OutOfMemory&#8217; exception. Most Java virtual machines allocate a certain maximum amount of memory which is much less than the amount of RAM to run Java programs. However, we can extend the memory available for the virtual machine by setting appropriate options. For large data processing we can take a look at <a href="http://lucene.apache.org/mahout/">Mahout</a>, an open source <span style="font-family: Verdana, Helvetica, sans-serif; line-height: 15px;">scalable, Apache licensed machine learning libraries</span></p>
<p>Read more at<a href="http://computerworld.co.nz/news.nsf/spec/0874D9A6E2E299D6CC257629006ECB53"> computer world</a>.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2009%2F09%2F23%2Fweka%2F&amp;linkname=WEKA%20pens%20victory%20at%20the%20University%20of%20California%20San%20Diego%20student%20datamining%20competition"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2009/09/23/weka/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KDD 2009: Registration Opens</title>
		<link>http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/</link>
		<comments>http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/#comments</comments>
		<pubDate>Fri, 29 May 2009 19:05:22 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[Conference]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=5</guid>
		<description><![CDATA[
The annual ACM SIGKDD conference is the premier international forum for data mining researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. KDD-09 will feature keynote presentations, oral paper presentations, poster sessions, workshops, tutorials, panels, exhibits, demonstrations, and the KDD Cup competition.
The registration is now open. The program [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://www.sigkdd.org/kdd2009/"><img class="size-medium wp-image-6  aligncenter" title="KDD 2009" src="http://dataminingtools.net/blog/wp-content/uploads/2009/05/kdd1-300x33.jpg" alt="The best in Data Mining happens here" width="300" height="33" /></a></p>
<p>The annual ACM SIGKDD conference is the premier international forum for data mining researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. KDD-09 will feature keynote presentations, oral paper presentations, poster sessions, workshops, tutorials, panels, exhibits, demonstrations, and the KDD Cup competition.</p>
<p>The<a href="http://www.sigkdd.org/kdd2009/registration.html" target="_blank"> registration </a>is now open. The <a title="KDD Program" href="http://www.sigkdd.org/kdd2009/program.html" target="_blank">program</a> is made available. Students make sure to visit <a href="http://www.sigkdd.org/kdd2009/student_travel.html" target="_blank">travel grants section</a> to apply for awards. And for all those who plan to travel check out the <a href="http://www.sigkdd.org/kdd2009/travel.html#deals" target="_blank">deals</a> posted.</p>
<p>For those intrested to see some interesting talks from KDD 2008, <a href="http://videolectures.net/kdd08_las_vegas/" target="_blank">videos </a>are available.</p>
<p>Venue: 17 Boulevard Saint-Jacques, 75014 Paris, France<br />
</p>
<p><div>
	<h2>
		<a href="http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/"></a>
	</h2>
	<p>
			<a href="http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/image-page/1" rel="nofollow" title="kdd1">
			<img src="http://dataminingtools.net/blog/wp-content/uploads/picturesurf/_1/ST_1hrr4hqv3.jpg" style="margin:2px 0; border:1px solid #BDC7D8"/>
		</a>
			<a href="http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/image-page/2" rel="nofollow" title="09_03_23_galerie-cote-gauche">
			<img src="http://dataminingtools.net/blog/wp-content/uploads/picturesurf/_1/ST_2mkvd0wlj.jpg" style="margin:2px 0; border:1px solid #BDC7D8"/>
		</a>
			<a href="http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/image-page/3" rel="nofollow" title="09_03_23_Petite-salle-attenante">
			<img src="http://dataminingtools.net/blog/wp-content/uploads/picturesurf/_1/ST_3f9aotwf3.jpg" style="margin:2px 0; border:1px solid #BDC7D8"/>
		</a>
			<a href="http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/image-page/4" rel="nofollow" title="09_03_23_Salle-avec-chaises-2">
			<img src="http://dataminingtools.net/blog/wp-content/uploads/picturesurf/_1/ST_4tfwtlc2l.jpg" style="margin:2px 0; border:1px solid #BDC7D8"/>
		</a>
			<a href="http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/image-page/5" rel="nofollow" title="MUSARROM42">
			<img src="http://dataminingtools.net/blog/wp-content/uploads/picturesurf/_1/ST_5th510hiji.jpg" style="margin:2px 0; border:1px solid #BDC7D8"/>
		</a>
			<a href="http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/image-page/6" rel="nofollow" title="parst_phototour01">
			<img src="http://dataminingtools.net/blog/wp-content/uploads/picturesurf/_1/ST_6bjias26d.jpg" style="margin:2px 0; border:1px solid #BDC7D8"/>
		</a>
		</p>
</div></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2009%2F05%2F29%2Fkdd-2009-registration-opens%2F&amp;linkname=KDD%202009%3A%20Registration%20Opens"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2009/05/29/kdd-2009-registration-opens/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

