<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title> &#187; Market Research</title>
	<atom:link href="http://dataminingtools.net/blog/tag/market-research/feed/" rel="self" type="application/rss+xml" />
	<link>http://dataminingtools.net/blog</link>
	<description></description>
	<lastBuildDate>Mon, 25 Jul 2011 08:51:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Data Mining continues to aid Cyber Security</title>
		<link>http://dataminingtools.net/blog/2010/03/19/data-mining-continues-to-aid-cyber-security/</link>
		<comments>http://dataminingtools.net/blog/2010/03/19/data-mining-continues-to-aid-cyber-security/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 14:11:03 +0000</pubDate>
		<dc:creator>vinayak</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Market Research]]></category>
		<category><![CDATA[news]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=471</guid>
		<description><![CDATA[Mr. Craig Shue, a cyber security research scientist at the Oak Ridge National Lab, said that it is clear that a large fraction of Internet address ranges at many ISPs engaged in malicious activity.He added &#8220;these [networks] may harbor malicious activity and should be investigated.&#8221;
This statement can be set as the abstract of a new research being [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">Mr. <a href="http://www.cs.indiana.edu/cgi-pub/cshue/index.php" target="_blank">Craig Shue</a>, a cyber security research scientist at the Oak Ridge National Lab, said that it is clear that a large fraction of Internet address ranges at many ISPs engaged in malicious activity.He added &#8220;these [networks] may harbor malicious activity and should be investigated.&#8221;</p>
<p style="text-align: justify;">This statement can be set as the abstract of a new research being carried out on data mining. According to this <a href="http://www.csiir.ornl.gov/shue/research/infocommini10.pdf" target="_blank">new research</a> , by researchers from Indiana University at Bloomington and the Oak Ridge National Laboratory in Oak Ridge, TN,  tracking of organized criminal activities across the web by the cyber gangs will be much easier now.</p>
<p style="text-align: justify;">This Research identifies dense clusters of ISPs that appear to be overly tolerant of malicious activity from anti-malware, anti-spam companies and phishing blacklists. They state that such patterns were particularly evident in Eastern Europe and the Middle East after comparing data from variety of services that Measure ISPs. Acording to them an ISP is classified as malicious if it harbored at least 2.5 percent of the malicious Internet addresses for a given data set, such as the list of phishing sites or malware-laced sites. They found 58 networks that each had more than 100,000 compromised hosts in their Internet address space ranges, while another 255 networks had between 10,000 and 100,000 systems blacklisted.</p>
<p style="text-align: justify;">Measuring online threats largely depends on their geographic location and focus. The study includes information on phishing websites from <a href="http://phishtank.com/" target="_blank">Phishtank.com</a> and the <a href="http://www.antiphishing.org/" target="_blank">Anti-Phishing Working Group</a>; botnet data from the <a href="http://www.shadowserver.org/wiki/" target="_blank">Shadowserver Foundation</a>; spam data from Indiana University, <a href="http://www.spamhaus.org/" target="_blank">Spamhaus</a>, <a href="http://www.surbl.org/" target="_blank">SURBL</a>, and <a href="http://www.support-intelligence.com/home/home.action" target="_blank">Support Intelligence</a>; malware hosting stats from organizations such as <a href="http://www.clean-mx.de/" target="_blank">CleanMX</a>, <a href="http://www.esoft.com/" target="_blank">eSoft</a>, and <a href="http://www.malware.com.br/" target="_blank">Malware Patrol</a>.</p>
<p style="text-align: justify;">Ukraine, Iran and Belarus were found to be in an alarming stage as they had more than 80 percent of their Internet address ranges blacklisted for a combination of spam, phishing, and hosting malicious software. Their ISP count were two, one and one respectively. On the other hand Turkey captured the limelight while analyzing (mining) the data on prevalence of servers that criminals use to control botnets. They covered almost 9.11% of the total internet  addresses  listed through a large broadband ISP.</p>
<p style="text-align: justify;">Another strategy, that brought United States into notice, which identifies problem networks based on the number of blacklisted addresses for a given ISP. This method usually points to the world&#8217;s largest ISPs.</p>
<p style="text-align: justify;">One more approach was quite successful in identifying zombie systems. It was to identify ISPs and hosting providers that had a disproportionate number of network peers that were malicious. With the help of this approach 22 networks were found to be purely malicious, while some 194 networks were found to be partially malicious.</p>
<p style="text-align: justify;">This research will definately be of great help and development of  internet security and law enforcements in this field.</p>
<p style="text-align: justify;">For more details : http://www.csiir.ornl.gov/shue/research/infocommini10.pdf</p>
<!-- Easy AdSenser V2.40 -->
<!-- Post[count: 2] -->
<div style="float:left;margin:12px;" ><script type="text/javascript"><!--
google_ad_client = "pub-8731508784204217";
/* 120x90, created 5/22/09 */
google_ad_slot = "1258512664";
google_ad_width = 120;
google_ad_height = 90;
//-->
</script>
<script type="text/javascript"
src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
</script></div><p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F03%2F19%2Fdata-mining-continues-to-aid-cyber-security%2F&amp;linkname=Data%20Mining%20continues%20to%20aid%20Cyber%20Security"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/03/19/data-mining-continues-to-aid-cyber-security/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>SAS: Leader in Predictive Analytics</title>
		<link>http://dataminingtools.net/blog/2010/02/06/sas-leader-in-predictive-analytics/</link>
		<comments>http://dataminingtools.net/blog/2010/02/06/sas-leader-in-predictive-analytics/#comments</comments>
		<pubDate>Sat, 06 Feb 2010 15:12:01 +0000</pubDate>
		<dc:creator>vinayak</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Review]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Forrester]]></category>
		<category><![CDATA[Market Research]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[press release]]></category>
		<category><![CDATA[SAS]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=412</guid>
		<description><![CDATA[SAS is the leader in business analytics software and services, and the largest independent vendor in the business intelligence market.  SAS predictive analytics and data mining solutions were evaluated by Forrester against 53 criteria in three categories through vendor surveys, product demonstration and vendor-reference interviews. SAS earned top overall ranking in all three categories &#8212; [...]]]></description>
			<content:encoded><![CDATA[<p><strong><a href="http://www.sas.com" target="_blank">SAS</a></strong> is the leader in business analytics software and services, and the largest independent vendor in the business intelligence market.  SAS predictive analytics and data mining solutions were evaluated by Forrester against 53 criteria in three categories through vendor surveys, product demonstration and vendor-reference interviews. SAS earned top overall ranking in all three categories &#8212; current offering, strategy and market presence &#8212; including perfect scores for functionality, professional services, licensing and cost, direction, and company financials criteria and has been named a leader among nine vendors in The <a href="http://www.forrester.com/rb/Research/wave%26trade;_predictive_analytics_and_data_mining_solutions,/q/id/56077/t/2" target="_blank">Forrester Wave</a>: Predictive Analytics and Data Mining Solutions, Q1 2010.</p>
<p>Today&#8217;s industry generates large volumes of data from all sectors such as  financial, retail,  factory, call centers, and customer products,  and so forth, SAS Analytics lets them realize the value within these growing volumes of data.</p>
<p>[<a href="http://www.sas.com/news/preleases/forresterwave-predictiveanalytics-datamining.html" target="_blank">Read more @ SAS</a>]</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F02%2F06%2Fsas-leader-in-predictive-analytics%2F&amp;linkname=SAS%3A%20Leader%20in%20Predictive%20Analytics"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/02/06/sas-leader-in-predictive-analytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The datamining journey ahead ..</title>
		<link>http://dataminingtools.net/blog/2010/01/03/the-datamining-journey-ahead/</link>
		<comments>http://dataminingtools.net/blog/2010/01/03/the-datamining-journey-ahead/#comments</comments>
		<pubDate>Mon, 04 Jan 2010 02:46:25 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Review]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Market Research]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=318</guid>
		<description><![CDATA[The data mining journey ahead is far and wide. As we enter the 21st century, the sheer volume of data will explodes on our planet where information is being authored by billions of people and flowing from a trillion intelligent devices, sensors and instrumented objects which become a part of everyday life. About 80% of [...]]]></description>
			<content:encoded><![CDATA[<p>The data mining journey ahead is far and wide. As we enter the 21<sup>st </sup>century, the sheer volume of data will explodes on our planet where information is being authored by billions of people and flowing from a trillion intelligent devices, sensors and instrumented objects which become a part of everyday life. About 80% of this new data is unstructured content. Data mining and machine learning are the technologies which help in capturing all this data and turn it into actual intelligence.</p>
<p>Data mining and machine learning, even today, is being used in various fields, be it sciences, engineering or even entertainment. Data mining in customer relationship management applications can contribute significantly to the bottom line. Rather than randomly contacting a prospect or customer through a call centre or sending mail, a company can concentrate its efforts on prospects that are predicted to have a high likelihood of responding to an offer. Machine learning is enabling more real artificial intelligence in computer games. In Bioinformatics, it is being used to detect patterns present in the DNA sequence which is very important to the human genome project.</p>
<div id="attachment_319" class="wp-caption aligncenter" style="width: 210px"><img class="size-full wp-image-319" title="Datamining" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/nsia_data_mining1.jpg" alt="Photography: infocusmagazine.org" width="200" height="159" /><p class="wp-caption-text">Photography: infocusmagazine.org</p></div>
<p>In the future data mining will help solve problems with far more bigger impacts on mankind, with scales which we have not seen till now. The complexity of these challenges will make usage of machines necessary, not just as a source of data management but also as a source of intelligence. Data mining and machine learning can provide very innovative, out of the box and practical solutions in the fields like climate change, energy, education, health etc.  Here we look at some of the challenges of the 21<sup>st</sup> century and the way data mining and machine learning can be of some assistance.</p>
<p><strong>Climate Change</strong></p>
<p>Climate change is one of the biggest challenges we face and one which needs immediate attention. One of the major reasons for the current situation we are in is inappropriate usage of fossils fuel. To this end we are trying some alternative sources of energy. Data mining can play a crucial role in this by monitoring not only our usage but also figuring out patterns in global patterns which will help determine the best source of alternate energy for a region. We have been able to develop sources of energy using renewable sources like solar energy, wind based turbines as well as turbines harnessing massive source of energy present in the ocean currents. But all these sources of energy might not be appropriate for every region and we will need a detailed region wise study to gather information which will help in judging the best sources to be used during different seasons in a year for the region so as to provide a continuous flow of energy. The complexity and the huge amount of data involved in this study will make the use of computers indispensible. Data mining will enable in figuring out climate patterns and suggesting statistically appropriate results as to which is the best source of energies for a region.</p>
<div id="attachment_323" class="wp-caption aligncenter" style="width: 310px"><img class="size-full wp-image-323" title="climatic changes" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/18-change-global-climate-change-brian-hayes-usa-thumb.jpg" alt="Photo Courtesy: Brian Hayes, USA" width="300" height="420" /><p class="wp-caption-text">Photo Courtesy: Brian Hayes, USA</p></div>
<p>Machine learning will also help us to counter some of the negative impacts of climate change like changes in the land usage patterns. Data mining, which is the partially automated search for hidden patterns in large databases, offers great potential benefits for applied Geographic Information Systems-based decision-making. Recently, the task of integrating these two technologies has become critical, especially as various public and private sector organisations possessing huge databases with thematic and geographically referenced data begin to realise the huge potential of the information hidden there. Environmental agencies are assessing the impact of changing climate conditions on land-use patterns data mining techniques on these vast sources of information. The days of Global Environmental Protection Agency(G-EPA) are here, and yes, these are the men in green!</p>
<p><strong>Energy</strong></p>
<p>Energy consumption leading to its depletions are on the rise. While looking out for renewable sources of energy is crucial, energy management in present time is equally important. We have large scale inefficiencies in the whole process, from energy production to energy distribution to energy consumption. Today these areas are mainly handles manually, using computers just for data management with minimal usage of computer based intelligence. But as with most of the problems, the complexity involved, will make computer based techniques like data mining crucial for an optimal solution to this problem [1].</p>
<p>In 2007, IBM formed a coalition of innovative utility companies to accelerate the use of smart grid technologies and move the industry forward through its most challenging transformation. The Global Intelligent Utility Network Coalition wants to change the way power is generated, distributed and used by adding digital intelligence to the current systems to reduce outages and faults, manage demand, and integrate renewable energy sources such as wind and power. Smart Grid is even being tested close to home at North Delhi Power Limited (NDPL) which is one of the biggest distributers of electrical power in Delhi, India.</p>
<p><em> </em></p>
<p><em> </em></p>
<p><em> </em></p>
<p><em> </em></p>
<p style="text-align: center;"><em></p>
<div id="attachment_321" class="wp-caption aligncenter" style="width: 460px"><img class="size-full wp-image-321 " title="data mining" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/090328A01.jpg" alt="Image Credit: http://my.reset.jp/~adachihayao/" width="450" height="406" /><p class="wp-caption-text">Image Credit: http://my.reset.jp/~adachihayao/</p></div>
<p></em></p>
<p><em> </em></p>
<p><strong>Health</strong></p>
<p>With the emergence of intelligent systems, in no time we will see a wide spread usage of machine intelligence in the fields of medicine and health. There is already some work going on in using data mining to identify the outburst of diseases. Public health services are searching for explanations of disease clusters by identifying common geographical, economical and social patterns existing there. This will greatly help in identifying a disease outbreak in advance, drugs management in a region and also in identifying preventive measures appropriate for a region.</p>
<p>Machine learning is increasingly being used in the process of generation of drugs which will me most effective on a person based on his/her DNA pattern. In the area of study on human genetics, the important goal is to understand the mapping relationship between the inter-individual variation in human DNA sequences and variability in disease susceptibility. In lay terms, it is to find out how the changes in an individual&#8217;s DNA sequence affect the risk of developing common diseases such as cancer. This is very important to help improve the diagnosis, prevention and treatment of the diseases. The data mining technique that is used to perform this task is known as multifactor dimensionality reduction [2 ].</p>
<p><strong> </strong></p>
<p><strong>Education</strong></p>
<p>Today if a student wants to make decisions related to his/her career like which courses to take and which career path will suit his/her interests in the long run, on global scale these decisions are based most of the times on pure assumptions. We do have large sources of relevant information on the internet, but again this information is in the form of segregated and unstructured data. Such assumptions based decisions often results in a wrong choice which can impact ones career as well as his/her behavious in the long term. What we need is an intelligent systems which by using the past record of a student, educational or otherwise, to assist him/her in this decision making. This can again be done with the help of machine learning which will take records related to education as well as the psychology of the person as input to give a more logical and less assumption based decision [3 ].</p>
<p><strong>Cyber Security</strong></p>
<p>With the fast integration of our lives with internet, cyber security will be of utmost importance in the future. Though we have been able to take some measures with regards to this, but still there are some loop holes present which can be of grave threat to an individual.</p>
<p>Some very innovative work is being done in the field of data mining and machine learning to tackle the issue of cyber security. Army High Performance Computing Research Centre (AHPCRC) has developed an intrusion detection system called <strong>– </strong><span style="text-decoration: underline;">M</span>innesota <span style="text-decoration: underline;">In</span>trusion  <span style="text-decoration: underline;">D</span>etection <span style="text-decoration: underline;">S</span>ystem (MINDS) which used advanced data mining techniques to detect cyber threats. Its anomaly detection system Detect novel attacks/intrusions by identifying them as deviations from “normal”, i.e. anomalous behavior. It does so by defining what is normal and anything having different characteristics as a deviation and hence as a possible threat.</p>
<p><strong>Space Exploration</strong></p>
<p>Space Exploration is being given more and more importance as we try to figure out the laws which govern the universe, our origin and ultimately other pools of resources in our solar system. Till the last century the main problem with space exploration was the ways by which we can gather more information. As we make advance in space technology the processing of large amount of data we gather will become more and more complicated. The number of factors one has to study in the data is enormous. It can be related to physics, chemistry and may be biology too.</p>
<div id="attachment_324" class="wp-caption aligncenter" style="width: 450px"><img class="size-full wp-image-324" title="SpaceShuttle2" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/SpaceShuttle2.jpg" alt="Photography: matricresearch.com" width="440" height="272" /><p class="wp-caption-text">Photography: matricresearch.com</p></div>
<p>Data mining provide the necessary assistance in this learning process. Data mining techniques are being used to study the elemental differences between different planets of the solar system, which is being used to deduce the chronological order of the formation of the solar system as well as the core reasons for the presence of life on planet earth.</p>
<p><strong> </strong></p>
<p><strong> </strong></p>
<p><strong>Security and counter-terrorism </strong><strong> </strong></p>
<p>One of the issues which will be faced by many countries of the world is global terrorism. One of the ways to tackle it is have strong knowledge about suspicious identities and there association. Two plausible data mining techniques in the context of combating terrorism include &#8220;pattern mining&#8221; and &#8220;subject-based data mining&#8221;.</p>
<p>In the context of pattern mining as a tool to identify terrorist activity, the National Research Council provides the following definition: <em>&#8220;Pattern-based data mining looks for patterns (including anomalous data patterns) that might be associated with terrorist activity — these patterns might be regarded as small signals in a large ocean of noise.&#8221;</em></p>
<p>&#8220;Subject-based data mining&#8221; is a data mining technique involving the search for associations between individuals in data. In the context of combating terrorism, the National Research Council provides the following definition: <em>&#8220;</em>Subject-based data mining uses an initiating individual or other datum that is considered, based on other information, to be of high interest, and the goal is to determine what other persons or financial transactions or movements, etc., are related to that initiating datum.<em>&#8220;</em></p>
<div id="attachment_320" class="wp-caption aligncenter" style="width: 310px"><img class="size-medium wp-image-320" title="fbi_key" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/fbi_key-300x136.gif" alt="Photography: wired.com" width="300" height="136" /><p class="wp-caption-text">Photography: wired.com</p></div>
<p><strong> </strong></p>
<p><strong> </strong></p>
<p>We will continue to face bigger and more complex challenges and, we will continue to make machines more intelligent so as to be used as alternate brains or cyber brains.</p>
<p>References:</p>
<p><em>[1] <a href="http://www.ibm.com/smarterplanet/us/en/smart_grid/ideas/?&amp;re=spf">http://www.ibm.com/smarterplanet/us/en/smart_grid/ideas/?&amp;re=spf</a></em></p>
<p><em>[2] <a href="http://en.wikipedia.org/wiki/Data_mining">http://en.wikipedia.org/wiki/Data_mining</a></em></p>
<p><em>[3] <a href="http://www.educationaldatamining.org/EDM2009/uploads/proceedings/vialardi.pdf">http://www.educationaldatamining.org/EDM2009/uploads/proceedings/vialardi.pdf</a></em></p>
<div><em><br />
</em></div>
<p>- Kartik Rustagi, SDE Intern.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F01%2F03%2Fthe-datamining-journey-ahead%2F&amp;linkname=The%20datamining%20journey%20ahead%20.."><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/01/03/the-datamining-journey-ahead/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>That&#8217;s one small step for robot, one giant leap for robotkind</title>
		<link>http://dataminingtools.net/blog/2010/01/03/thats-one-small-step-for-robot-one-giant-leap-for-robotkind/</link>
		<comments>http://dataminingtools.net/blog/2010/01/03/thats-one-small-step-for-robot-one-giant-leap-for-robotkind/#comments</comments>
		<pubDate>Sun, 03 Jan 2010 14:21:37 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Review]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Market Research]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=309</guid>
		<description><![CDATA[International Robot Exhibition 2009 has finished with a great success, this event was held during November 25, 2009 to November 28, 2009 at Tokyo International Exhibition Center in Tokyo,Japan. International Robot Exhibition 2009 show is designed to provide a place to exhibit robots and related equipments in order to enhance market awareness of new technology. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.nikkan.co.jp/eve/irex/english/index.html" target="_blank">International Robot Exhibition 2009 </a>has finished with a great success, this event was held during November 25, 2009 to November 28, 2009 at Tokyo International Exhibition Center in Tokyo,Japan. International Robot Exhibition 2009 show is designed to provide a place to exhibit robots and related equipments in order to enhance market awareness of new technology. At the same time, the show is to be a medium to promote new products and to develop new business through contributing the promotion of new technology. Some of the highlights have been illustrated below.</p>
<p>This is a clear indication of what we can expect in the near future.(Oh boy! Not another American Robot idol, The Amazing robot race, America&#8217;s next top robot model, or robot factor)</p>
<p>Read more about the event <a href="http://www.nikkan.co.jp/eve/irex/english/index.html" target="_blank">here</a> and for sure visit the compiled photo <a href="http://www.guardian.co.uk/technology/gallery/2009/dec/02/robots-japan?picture=356334757" target="_blank">gallery1</a> and <a href="http://pinktentacle.com/2009/11/photos-international-robot-exhibition-2009/" target="_blank">gallery2</a> for more pictures.</p>
<div id="attachment_311" class="wp-caption alignleft" style="width: 380px"><img class="size-full wp-image-311" title="CyberGlove-at-Internation-007" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/CyberGlove-at-Internation-007.jpg" alt="The robot hand is capable of 24 movements and can be remote-operated with the CyberGlove Photograph: Kim Kyung-hoon/Reuters" width="370" height="500" /><p class="wp-caption-text">The robot hand is capable of 24 movements and can be remote-operated with the CyberGlove Photograph: Kim Kyung-hoon/Reuters</p></div>
<div id="attachment_312" class="wp-caption alignleft" style="width: 460px"><img class="size-full wp-image-312 " title="Humanoid-industrial-robot-008" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/Humanoid-industrial-robot-008.jpg" alt="Humanoid industrial robot 'Motoman-SDA5D', developed by the Yaskawa Electric Corporation, demonstrates its capabilities with Lego Photograph: Dai Kurokawa/EPA" width="450" height="390" /><p class="wp-caption-text">Humanoid industrial robot &#39;Motoman-SDA5D&#39;, developed by the Yaskawa Electric Corporation, demonstrates its capabilities with Lego Photograph: Dai Kurokawa/EPA</p></div>
<div id="attachment_313" class="wp-caption alignleft" style="width: 448px"><img class="size-full wp-image-313" title="A-humanoid-robot-hip-hop--012" src="http://dataminingtools.net/blog/wp-content/uploads/2010/01/A-humanoid-robot-hip-hop-012.jpg" alt="A humanoid robot 'Manoi AT01', produced by Japan's toy robot maker Kyosho, performs a hip-hop dance Photograph: Yoshikazu Tsuno/AFP/Getty Images" width="438" height="390" /><p class="wp-caption-text">A humanoid robot &#39;Manoi AT01&#39;, produced by Japan&#39;s toy robot maker Kyosho, performs a hip-hop dance Photograph: Yoshikazu Tsuno/AFP/Getty Images</p></div>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2010%2F01%2F03%2Fthats-one-small-step-for-robot-one-giant-leap-for-robotkind%2F&amp;linkname=That%26%238217%3Bs%20one%20small%20step%20for%20robot%2C%20one%20giant%20leap%20for%20robotkind"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2010/01/03/thats-one-small-step-for-robot-one-giant-leap-for-robotkind/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The datamining journey so far ..</title>
		<link>http://dataminingtools.net/blog/2009/12/31/the-datamining-journey-so-far/</link>
		<comments>http://dataminingtools.net/blog/2009/12/31/the-datamining-journey-so-far/#comments</comments>
		<pubDate>Thu, 31 Dec 2009 12:09:41 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Review]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Market Research]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=281</guid>
		<description><![CDATA[This new year, let us go through all the major developments that have taken place in the Data Mining industry over the years. Here is a quick glimpse:

A description:



1993


Development   of WEKA begins: 

In 1993, the University of Waikato in New   Zealand started development of the original version of Weka.  Weka (Waikato [...]]]></description>
			<content:encoded><![CDATA[<p>This new year, let us go through all the major developments that have taken place in the Data Mining industry over the years. Here is a quick glimpse:</p>
<p><img class="alignleft size-full wp-image-303" title="datamining journey so far" src="http://dataminingtools.net/blog/wp-content/uploads/2009/12/final2-with-logo.png" alt="datamining journey so far" width="450" height="1157" /></p>
<p>A description:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top"><strong>1993</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Development   of WEKA begins: </strong>
<ul>
<li>In 1993, the University of Waikato in New   Zealand started development of the original version of Weka.  Weka (Waikato Environment for Knowledge   Analysis) is a popular suite of machine learning software written in Java,   developed at the University of Waikato. WEKA is free software available under   the GNU General Public License.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>1996</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>CRISP-DM   is conceived</strong>
<ul>
<li>CRISP-DM stands for CRoss Industry Standard   Process for Data Mining. It is a data mining process model that describes   commonly used approaches that expert data miners use to tackle problems.   Polls conducted later in 2002, 2004, and 2007 show that it is the leading   methodology used by data miners</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>1998</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>KXEN  established</strong>
<ul>
<li>Founded in 1998, KXEN has corporate offices   in San Francisco, California and Paris, France, with Fortune 1000 customers   around the world.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>1999</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>CRISP-DM   1.0 released</strong>
<ul>
<li>After it was conceived in 1996, in 1997   CRISP-DM got underway as a European Union project under the ESPRIT funding   initiative. The project was led by four companies: ISL, NCR   Corporation,Daimler-Benz and OHRA. The first version of the methodology was   released as CRISP-DM 1.0 in 1999.<strong> </strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2000</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>The &#8216;R&#8217;   Project considered stable for production</strong>
<ul>
<li>R is an implementation of the S programming   language with lexical scoping semantics inspired by Scheme. R was created by   Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand,   and is now developed by the R Development Core Team.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2003</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Appricon   established</strong>
<ul>
<li>In   order to provide a better data mining solution, Analysis Studio® and the   Analysis Studio® end-to-end logistic regression modeling solution were weaved   into enterprise data mining projects in 2003.<strong> </strong></li>
<li><strong>SAS   9.1 was released in 2003</strong><strong> </strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2004</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Rapidminer distributed with GNU   license</strong><strong> </strong>
<ul>
<li>The initial version has been developed by the   Artificial Intelligence Unit of <a title="Dortmund University of Technology" href="http://en.wikipedia.org/wiki/Dortmund_University_of_Technology">University of Dortmund</a> since   2001. It is distributed under a <a title="GNU" href="http://en.wikipedia.org/wiki/GNU">GNU</a> license, and   has been hosted by <a title="SourceForge" href="http://en.wikipedia.org/wiki/SourceForge">SourceForge</a>since 2004.</li>
<li><strong>SAS   9.1.2 was released in 2004.</strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2005</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Amazon   launches Mechanical Turk</strong>
<ul>
<li>The service was launched publicly on November   2, 2005. In early- to mid-November 2005, there were tens of thousands of   HITs, all of them uploaded to the system by Amazon itself for some of its   internal tasks that required human intelligence. Most of these were related   to music CD items.</li>
<li>The number of Amazon&#8217;s Mechanical Turk HITs in   the system soon decreased after its launch in november, and by December 20,   there were less than 100 groups of HITs on the average page load</li>
<li><strong>Weka   receives the SIGKDD Data Mining and Knowledge Discovery Service Award</strong></li>
<li><strong>SAS   9.1.3 was released in 2005.</strong></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2006</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>Work on   CRISP-DM 2.0 begins</strong>
<ul>
<li>In July 2006 the consortium of CRISP-DM   announced that it was going to start the process of working towards a second   version of CRISP-DM. On 26 September 2006, the CRISP-DM SIG met to discuss potential   enhancements for CRISP-DM 2.0 and the subsequent roadmap.</li>
<li><strong>Pentaho   acquires exclusive …..</strong>
<ul>
<li>In 2006, Pentaho Corporation acquired an   exclusive license to use Weka for business intelligence. It forms the data   mining and predictive analytics component of the Pentaho business   intelligence suite.</li>
</ul>
</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2008</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>COGNOS   acquired by IBM</strong>
<ul>
<li>Cognos (Cognos Incorporated) was an Ottawa,   Ontario based company making business intelligence (BI) and performance   management software. On January 31, 2008, Cognos was officially acquired by   IBM. The Cognos name continues to be used, being applied to IBM&#8217;s line of   business intelligence (BI) and performance management products.</li>
<li><strong>SAS 9.2   is the latest release (March 2008</strong>) and was demonstrated at SAS Global   Forum (previously called SUGI) 2008.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top"><strong>2009</strong></td>
<td width="520" valign="top">
<ul>
<li><strong>PASW/   SPSS</strong>
<ul>
<li>PASW (formerly SPSS) is a computer program   used for statistical analysis. Before 2009 it was called SPSS, but in 2009 it   was re-branded as PASW (Predictive Analytics Software). The company announced   July 28, 2009 that it was being acquired by IBM for US$1.2 billion.</li>
</ul>
</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Microsoft:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="54" valign="top">1996</td>
<td width="539" valign="top">
<ul>
<li>Microsoft   opens new team to build an OLAP product, codenamed Plato (permutation of   letters from OLAP)</li>
<li>Panorama   Software delegation meets with Microsoft</li>
<li>Microsoft   announces acquisition of Panorama Software development team</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">1997</td>
<td width="539" valign="top">
<ul>
<li>OLAP   Services 7.0 (codename Sphinx) ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2000</td>
<td width="539" valign="top">
<ul>
<li><strong>Analysis Services 2000</strong> (codename Shiloh) ships
<ul>
<li>Microsoft Analysis Services is part of   Microsoft SQL Server, a database management system. Microsofthas included a   number of services in SQL Server related to Business Intelligence and Data   Warehousing. These services include Integration Services and Analysis   Services. Analysis Services includes a group ofOLAP and Data Mining   capabilities.</li>
<li>Microsoft Corp. announces the <strong>beta release of the OLE DB for Data   Mining specification</strong>, a protocol based on the SQL language, that provides   software vendors and application developers with an open interface to more   efficiently integrate data mining tools and capabilities into   line-of-business and e-commerce applications.</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2001</td>
<td width="539" valign="top">
<ul>
<li>XML for   Analysis SDK 1.0 ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2004</td>
<td width="539" valign="top">
<ul>
<li>ADOMD.NET   and XML for Analysis SDK 1.1 ship</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>Analysis   Services 2005 (codename Yukon) ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>Analysis   Services 2008 (codename Katmai) ships</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2009</td>
<td width="539" valign="top">
<ul>
<li>Microsoft has decided to make the BI Conference   into a biennial event, with the next conference in 2010. For 2009, we are   excited to team with the Professional Association for SQL Server (PASS) to   expand the BI tracks at PASS Summit 2009 and help deliver the content that BI   architects, developers, and administrators need to get the most value from   their Microsoft SQL Server and BI-based solutions.</li>
<li><strong>PowerPivot </strong>gives users the power to create compelling self-service BI solutions,   facilitates sharing and collaboration on user-generated BI solutions in a   Microsoft SharePoint Server 2010 environment, and enables IT organizations to   increase operational efficiencies through Microsoft SQL Server 2008 R2-based   management tools.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Amazon:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="54" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>&#8220;Search Inside the Book&#8221; is a   feature which allows customers to search for keywords in the full text of   many books in the catalog. The feature started with 120,000 titles (or   33 million pages of text) on October 23, 2003. There are currently about   250,000 books in the program. Amazon has cooperated with around 130 publishers to   allow users to perform these searches.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>In November 2005, Amazon.com began   testing Amazon Mechanical Turk, an application programming   interface (API) allowing programs to dispatch tasks to human processors.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2006</td>
<td width="539" valign="top">
<ul>
<li>Amazon launched an online storage service   called Amazon Simple Storage Service (Amazon S3). An unlimited   number of data objects, from 1 byte to 5 gigabytes in   size, can be stored in S3 and distributed via HTTP or <a title="BitTorrent (protocol)" href="http://en.wikipedia.org/wiki/BitTorrent_(protocol)">BitTorrent</a> .In April 2006, Amazon   introduced Amazon Simple Queue Service (Amazon SQS), a distributed   queue messaging service.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li>In January 2007 Amazon launched <a title="Amapedia" href="http://en.wikipedia.org/wiki/Amapedia">Amapedia</a>, a   collaborative wiki for user-generated content to replace ProductWiki</li>
<li>In December 2007, Amazon introduced <a title="SimpleDB" href="http://en.wikipedia.org/wiki/SimpleDB">SimpleDB</a>, a   database system, allowing users of its other infrastructure to utilize a high   reliability high performance database system.</li>
</ul>
</td>
</tr>
<tr>
<td width="54" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>Amazon   Web Services launched a public beta of Amazon Elastic Compute Cloud running   Microsoft Windows Server and Microsoft SQL Server.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Yahoo!:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top">2002</td>
<td width="539" valign="top">
<ul>
<li><strong>Yahoo!   HotJobs</strong>, previously known as hotjobs.com, is an online job search engine.   It has been known as Yahoo! HotJobs since being acquired by Yahoo! in 2002.   Yahoo! HotJobs provides tools and advice for job seekers, employers, and   staffing firms.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Introduces Smartsort Technology:   Personalized Product Recommendation Tool
<ul>
<li>The new Yahoo! Product Search powers the   redesigned Yahoo! Shopping, providing consumers with the most comprehensive   and relevant comparison-shopping site on the Web. The redesigned Yahoo!   Shopping now boasts a variety of comparison-shopping features including:   side-by-side product comparison, detailed buyer&#8217;s guides, tax and shipping   calculator tool, consumer product and merchant ratings, unbiased expert   product reviews etc. Yahoo! Shopping is the third largest multi-category   commerce destination on the Web. (Nielsen//NetRatings, August 2003)</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2004</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Launches <strong>SmartView </strong>Technology: New Mapping Feature Creates Customized   Visual Search Capability</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Search Launches <strong>Search Subscriptions Beta</strong>, Providing Select Deep Web Content to   Users</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2006</td>
<td width="539" valign="top">
<ul>
<li>Yahoo! Opens <strong>Internet Time Capsule </strong>to Capture Life in 2006
<ul>
<li>SUNNYVALE, Calif., October 10, 2006 – Yahoo!   Inc. (Nasdaq:YHOO) today announced the launch of what is expected to be the   world&#8217;s largest time capsule in history. Starting today, Yahoo! is   encouraging people from around the world to contribute personal photos,   stories, thoughts, ideas, poems, home movies and art to this first-ever   electronic&#8230;</li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li><strong>Yahoo! pipes</strong>: Yahoo! Pipes was released to the public in beta on 7 February   2007.Yahoo! Pipes is a web application from Yahoo! that provides a graphical   user interface for building data mashups that aggregate web feeds, web pages,   and other services, creating Web-based apps from various sources, and   publishing those apps. The application works by enabling users to   &#8220;pipe&#8221; information from different sources and then set up rules for   how that content should be modified (for example, filtering).<strong> </strong></li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>The   software, called <strong>Hadoop</strong>, is part   of Yahoo&#8217;s massive computing grid and is transforming the way Yahoo and   corporate giants such as IBM extract meaning from enormous streams of data.   Universities are also using the code &#8211; an open-source version of software   Google relies on for daily operation &#8211; to train a new generation of computer   scientists and engineers. On February 19, 2008, Yahoo! launched what it   claimed was the world&#8217;s largest Hadoop production application. The Yahoo!   Search Webmap is a Hadoop application that runs on a more than 10,000 <a title="Multi-core" href="http://en.wikipedia.org/wiki/Multi-core">core</a> <a title="Linux" href="http://en.wikipedia.org/wiki/Linux">Linux</a> <a title="Cluster (computing)" href="http://en.wikipedia.org/wiki/Cluster_(computing)">cluster</a> and   produces data that is now used in every Yahoo! Web search query.</li>
</ul>
<ul>
<li><strong>Yahoo   joins OPEN SOCIAL</strong>: On Mar 25, 2008 Yahoo! also announced it has joined   the initiative . OpenSocial is a set of common application programming   interfaces (APIs) for web-based social network applications, developed by   Google along with MySpace and a number of other social networks. It was   released November 1, 2007. Applications implementing the OpenSocial APIs will   be interoperable with any social network system that supports them, including   features on sites such as Hi5.com, MySpace, orkut, Netlo], Sonico.com,   Friendster, Ning and Yahoo!.</li>
</ul>
<ul>
<li>Yahoo! Inc. announces the general availability   of <strong>Fire Eagle</strong> (http://fireeagle.yahoo.net), an open platform that helps users take their   location to the Web while giving them the ability to easily control how and   where their location data</li>
</ul>
<ul>
<li><strong>Yahoo!   Opens Up Search Technology Infrastructure for Innovative, New Search   Experiences, Providing Third Parties with Unprecedented Access, Re-Ranking   and Presentation Control of Web Search Results:</strong>
<ul>
<li><strong>BOSS:   Build your own search service</strong>:   The main goal and idea of BOSS is to give   users, in this case developers, free access to the <a title="Yahoo! Search" href="http://en.wikipedia.org/wiki/Yahoo!_Search">Yahoo! Search</a> <a title="Index (search engine)" href="http://en.wikipedia.org/wiki/Index_(search_engine)">index</a>.   The results can be supplied into the developer&#8217;s website or program so that   they can manipulate the resources according to their product&#8217;s requirements.   BOSS allows the results to be returned back in <a title="XML" href="http://en.wikipedia.org/wiki/XML">XML</a>, <a title="JSON" href="http://en.wikipedia.org/wiki/JSON">JSON</a>, <a title="HTML" href="http://en.wikipedia.org/wiki/HTML">HTML</a>, <a title="Text" href="http://en.wikipedia.org/wiki/Text">text</a> and   also allows the comprehensive search feature allowed in Yahoo like pulling   the results by pages, searching inside <a title="PDF" href="http://en.wikipedia.org/wiki/PDF">PDF</a>,   etc. The ranking of the websites for a search term is same as the <a title="Yahoo! Search" href="http://en.wikipedia.org/wiki/Yahoo!_Search">Yahoo! Search</a>ranking since both of these are   pulling from the same index and ranking.</li>
</ul>
</li>
</ul>
<p><strong> </strong></td>
</tr>
<tr>
<td width="45" valign="top">2009</td>
<td width="539" valign="top">
<ul>
<li>On June   10, 2009, Yahoo! released its own distribution of Hadoop.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Google:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top">1998</td>
<td width="539" valign="top">
<ul>
<li>Google   sets up workspace in Susan Wojcicki&#8217;s garage at <a href="http://maps.google.com/maps?q=232+santa+margarita,+menlo+park+ca&amp;ie=UTF8&amp;oe=utf-8&amp;client=firefox-a&amp;ll=37.457861,-122.163312&amp;spn=0.008431,0.019999&amp;z=16&amp;iwloc=addr">232 Santa Margarita, Menlo Park</a>.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2000</td>
<td width="539" valign="top">
<ul>
<li>The   first <a href="http://www.google.com/press/pressrel/pressrelease22.html">10 language versions of Google.com</a> are   released</li>
<li>Google   forges a <a href="http://www.google.com/press/pressrel/pressrelease25.html">partnership with Yahoo!</a> to   become their default search provider.</li>
<li>Google   search index reaches 1 billion pages</li>
<li>Google   toolbar is launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2001</td>
<td width="539" valign="top">
<ul>
<li>Image   Search <a href="http://www.google.com/googlefriends/jul2001.html">launches</a>, offering access to 250 million   images.</li>
<li>Google   is available in 26 languages</li>
<li>Search   index reaches 3 billion mark.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2002</td>
<td width="539" valign="top">
<ul>
<li>The   first Google hardware is <a href="http://news.cnet.com/Google-aims-search-device-at-companies/2100-1023_3-833905.html">released</a>: it&#8217;s a yellow box   called the <a href="http://www.google.com/enterprise/gsa/">Google Search Appliance</a> that   businesses can plug into their computer network to enable search capabilities   for their own documents.</li>
<li>Google   releases a <a href="http://www.google.com/press/pressrel/select.html">major overhaul</a> for <a href="https://adwords.google.com/">AdWords</a>,   including new cost-per-click pricing.</li>
<li>Google   releases a <a href="http://www.infoworld.com/articles/hn/xml/02/04/11/020411hngoogleapi.html">set of APIs</a>, enabling   developers to query more than 2 billion web documents and program in their   favorite environment, including Java, Perl and Visual Studio.</li>
<li>Users   can search for stuff to buy with <a href="http://searchenginewatch.com/showPage.html?page=2161381">Froogle</a> (later   called <a href="http://www.google.com/products">Google Product Search</a>).</li>
<li>Partnership   with AOL</li>
<li>Google   Labs is launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>Google   announces a new <a href="http://www.google.com/press/pressrel/advertising.html">content-targeted advertising service</a>,   enabling publishers large and small to access Google&#8217;s vast network of   advertisers. (Weeks later, on April 23, we acquired Applied Semantics, whose   technology bolsters the service named <a href="https://www.google.com/adsense">AdSense</a>.)</li>
<li>Google   acquires blogger.com</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2004</td>
<td width="539" valign="top">
<ul>
<li>Search   index reaches 8 billion</li>
<li>Orkut   released</li>
<li>Keyhole   Acquired</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>Urchin   acquired</li>
<li>Google   Maps, code.google.com launched</li>
<li>Google   image search boasts of 1.1 billion images.</li>
<li>iGoogle   launched</li>
<li>Google   Earth, Google talk launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2006</td>
<td width="539" valign="top">
<ul>
<li>YouTube   acquired</li>
<li>Jotspot   acquired</li>
<li>Google   docs and spreadsheets launched</li>
<li>Google   custom search launched</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li>Google   hot trends launched</li>
<li>Partnership   with salesforce.com</li>
<li>Postini   acquired</li>
<li>Joint   supercomputing project with IBM</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>DoubleClick   acquired</li>
<li>Google   index: 1 trillion</li>
<li>Google   Chrome browser launched</li>
<li>Google   tracks flu trends</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>IBM:</p>
<table border="1" cellspacing="0" cellpadding="0" width="450">
<tbody>
<tr>
<td width="45" valign="top">1995</td>
<td width="539" valign="top">
<ul>
<li>IBM acquires Lotus</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">1996</td>
<td width="539" valign="top">
<ul>
<li>IBM launches its DB2 relational database.</li>
<li>IBM acquires Tivoli.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">1998</td>
<td width="539" valign="top">
<ul>
<li>IBM launches the PowerPC 740/750 processors,   the world&#8217;s first manufactured using IBM&#8217;s copper manufacturing technology.</li>
<li>Two new AS/400s are introduced, as well as new   products in the Aptiva, PC, and Thinkpad series.The IBM S/390 computing system   for business is also launched.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">1999</td>
<td width="539" valign="top">
<ul>
<li>The S/390 G6 server, using IBM&#8217;s cop per   technology, is introduced.</li>
<li>IBM and Dell sign a $16 billion technoogy   agreement, where Dell will purchase IBM components for use in Dell systems.</li>
<li>IBM and Lotus found the Institute for Knowledge   Management.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2000</td>
<td width="539" valign="top">
<ul>
<li>IBM launches the NetVista line of PC devices.</li>
<li>IBM launches the eServer line.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2002</td>
<td width="539" valign="top">
<ul>
<li>Product offerings during 2002 include the   eServer p650 eight-way UNIX server, the eServer i890, and the IBM eServer xSeries   440.</li>
<li>IBM acquires Price Waterhouse Coopers&#8217; business   consulting and technology services unit for $3.5 billion in cash and stock.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2003</td>
<td width="539" valign="top">
<ul>
<li>IBM and Cisco announce a set of open software   technologies designed to advance the development of &#8220;self-healing&#8221;   computer systems and networks.</li>
<li>IBM and Siebel launch CRM OnDemand.</li>
<li>IBM launches its WebSphere business integration   software.</li>
<li>Japan&#8217;s largest research organization orders   an AMD Opteron based eServer 325 supercomputer, running Linux.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2005</td>
<td width="539" valign="top">
<ul>
<li>IBM plans to expand its data-integration   product line through a $1.1 billion acquisition of Ascential Software Corp.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2007</td>
<td width="539" valign="top">
<ul>
<li>Google and I.B.M. Join hands  in ‘Cloud Computing’ Research</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2008</td>
<td width="539" valign="top">
<ul>
<li>Researchers with IBM have developed a new set   of software applications designed to improve the human memory. The software   is designed to run on a smartphone or mobile handset and analyze collected   pieces of data. The collected data is then used to help the user better   remember faces and other information such as conversations.</li>
</ul>
</td>
</tr>
<tr>
<td width="45" valign="top">2009</td>
<td width="539" valign="top">
<ul>
<li>IBM boasts that its so-called Sequoia system   will be capable of crunching numbers 20 times faster than IBM&#8217;s last   record-breaker and 15 times faster than the current fastest machine.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Sources:</p>
<ul>
<li><a href="/DataMiningTools/top%2015%20datamining%20companies%20blog%20post/wikipedia.org"> Wikipedia</a>,</li>
<li><a href="http://docs.yahoo.com/info/pr/releases.html">Yahoo! Media Relations</a>,</li>
<li> <a href="http://www.microsoft.com/Presspass/default.mspx">Microsoft PressPass</a>,</li>
<li> <a href="http://www.google.com/press/">Google Press Center</a>,</li>
<li><a href="http://www.google.com/corporate/timeline/#start">Google Timeline</a></li>
<li><a href="http://powerpivotpro.com/">PowerPivotPro.com</a></li>
<li><a href="http://research.microsoft.com/en-us/">Microsoft Research</a></li>
<li><a href="http://phx.corporate-ir.net/phoenix.zhtml?p=irol-mediaHome&amp;c=176060">Amazon Media Room</a></li>
<li><a href="http://www.informationweek.com/">InformationWeek</a></li>
<li><a href="http://www.nytimes.com/">NYTimes</a></li>
</ul>
<p>&#8211;  SAGAR JAUHARI, SDE Intern.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2009%2F12%2F31%2Fthe-datamining-journey-so-far%2F&amp;linkname=The%20datamining%20journey%20so%20far%20.."><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2009/12/31/the-datamining-journey-so-far/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The path of Business Intelligence</title>
		<link>http://dataminingtools.net/blog/2009/10/04/the-path-of-business-intelligence/</link>
		<comments>http://dataminingtools.net/blog/2009/10/04/the-path-of-business-intelligence/#comments</comments>
		<pubDate>Sun, 04 Oct 2009 16:53:26 +0000</pubDate>
		<dc:creator>Vikramaditya Jakkula</dc:creator>
				<category><![CDATA[Review]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Market Research]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=163</guid>
		<description><![CDATA[What is Business Intelligence(BI)?
&#8220;Business Intelligence (BI) helps business people make more informed decisions by providing them timely, data-driven answers to their business questions. BI analyzes data stored in data warehouses, operational databases, and/or ERP systems (i.e. SAP®, Oracle, JD Edwards, Peoplesoft) and transforms it into attractive and easy to understand dashboards and reports. BI delivers [...]]]></description>
			<content:encoded><![CDATA[<p><strong>What is Business Intelligence(BI)?</strong></p>
<p>&#8220;Business Intelligence (BI) helps business people make more informed decisions by providing them timely, data-driven answers to their business questions. BI analyzes data stored in data warehouses, operational databases, and/or ERP systems (i.e. SAP®, Oracle, JD Edwards, Peoplesoft) and transforms it into attractive and easy to understand dashboards and reports. BI delivers the insight needed to make strategic planning decisions, improve operational efficiencies, and optimize business processes.&#8221; -<a href="http://www.microstrategy.com/Business-Intelligence/">Microstrategy</a>.</p>
<p><strong>What are BI tools?</strong></p>
<p style="margin-top: 0px; margin-right: 0px; margin-bottom: 1em; margin-left: 0px; outline-width: 0px; outline-style: initial; outline-color: initial; font-size: 11px; vertical-align: baseline; background-image: initial; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: transparent; background-position: initial initial; padding: 0px; border: 0px initial initial;">
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Business Intelligence (BI) tools  are a set of software systems and practices that enable organizations to analyze data, and make better decisions based on the insight from that information. Companies can use this insight to take the following steps to improve overall corporate performance:</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Enhance cost-efficiency and productivity</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Build strong customer relationships</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Optimize revenue-generating strategies</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Increase revenue and maximize profitability</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Monitor trends and discover anomalies</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Forecast business opportunities</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Maintain compliance and perform risk management</div>
<p>Business Intelligence (BI) tools  are a set of software systems and practices that enable organizations to analyze data, and make better decisions based on the insight from that information. Companies can use this insight to take the following steps to improve overall corporate performance:</p>
<ul>
<li>Enhance cost-efficiency and productivity</li>
<li>Build strong customer relationships</li>
<li>Optimize revenue-generating strategies</li>
<li>Increase revenue and maximize profitability</li>
<li>Monitor trends and discover anomalies</li>
<li>Forecast business opportunities</li>
<li>Maintain compliance and perform risk management</li>
</ul>
<p><strong>Categories of BI Tools: </strong></p>
<p><span style="font-family: sans-serif;"></p>
<ul style="line-height: 1.5em; list-style-type: square; margin-top: 0.3em; margin-right: 0px; margin-bottom: 0.5em; margin-left: 1.5em; list-style-image: url(http://en.wikipedia.org/skins-1.5/monobook/bullet.gif); padding: 0px;">
<li style="margin-bottom: 0.1em;"><a class="mw-redirect" style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="Spreadsheets" href="http://en.wikipedia.org/wiki/Spreadsheets">Spreadsheets</a></li>
<li style="margin-bottom: 0.1em;"><a style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="List of reporting software" href="http://en.wikipedia.org/wiki/List_of_reporting_software">Reporting and querying software</a></li>
<li style="margin-bottom: 0.1em;"><a class="mw-redirect" style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="OLAP" href="http://en.wikipedia.org/wiki/OLAP">OLAP</a></li>
<li style="margin-bottom: 0.1em;"><a style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="Dashboards (management information systems)" href="http://en.wikipedia.org/wiki/Dashboards_(management_information_systems)">Digital Dashboards</a></li>
<li style="margin-bottom: 0.1em;"><a style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="Data mining" href="http://en.wikipedia.org/wiki/Data_mining">Data mining</a></li>
<li style="margin-bottom: 0.1em;"><a style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="Process mining" href="http://en.wikipedia.org/wiki/Process_mining">Process mining</a></li>
<li style="margin-bottom: 0.1em;"><a style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="Business performance management" href="http://en.wikipedia.org/wiki/Business_performance_management">Business performance management</a></li>
<li style="margin-bottom: 0.1em;"><a style="text-decoration: none; color: #002bb8; background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; background-position: initial initial;" title="Local information systems" href="http://en.wikipedia.org/wiki/Local_information_systems">Local information systems</a></li>
</ul>
<p></span></p>
<p><strong>State of art of BI in Industry today:</strong></p>
<p>In one word, most companies <a href="http://www.reuters.com/article/pressRelease/idUS159572+22-Jan-2008+BW20080122" target="_blank">do not use BI to their fullest potential </a>according to market research.</p>
<p>Some stated reasons include:</p>
<ul>
<li>Lack of proper training, followed by limited staffing resources.</li>
<li>Most custom reports remain very sophisticated</li>
<li>IT staffs still create the majority of BI reports, followed by  business analysts.</li>
<li>Dissatisfaction with BI technology is found among all user communities.</li>
<li> The idea that BI applications can create more work</li>
</ul>
<p>Now the latest of all is the move of BI from industry to everyday life.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2009%2F10%2F04%2Fthe-path-of-business-intelligence%2F&amp;linkname=The%20path%20of%20Business%20Intelligence"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2009/10/04/the-path-of-business-intelligence/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Advertise the data mining way</title>
		<link>http://dataminingtools.net/blog/2009/09/23/advertising-the-data-mining-way/</link>
		<comments>http://dataminingtools.net/blog/2009/09/23/advertising-the-data-mining-way/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 04:32:36 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[Advertising]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Market Research]]></category>

		<guid isPermaLink="false">http://dataminingtools.net/blog/?p=80</guid>
		<description><![CDATA[
A discussion room in an advertising company will now comprise of a few data miners as well. Yes, this is the latest trend. The question of “why data miners in an ad agency?” will be the first one which would occur to our  mind. The simplest answer that can satisfy our ask is, the company [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-94" title="office" src="http://dataminingtools.net/blog/wp-content/uploads/2009/09/office1.jpg" alt="office" width="450" height="500" /></p>
<p>A discussion room in an advertising company will now comprise of a few data miners as well. Yes, this is the latest trend. The question of “why data miners in an ad agency?” will be the first one which would occur to our  mind. The simplest answer that can satisfy our ask is, the company wants to market ads based on user needs. Based on the user data analysis, the marketing would bare more tastier fruit. But why not we have a data analysis done by a market researcher? The ad agencies are looking ahead and taking one step further. They are paying close attention to user needs. Does optimization of ads ring a bell?  Also, with the recession taking up the market, companies sought to take every advantage they can leverage, even combining marketing with digital technology for better ads and better ROI.</p>
<p><strong> </strong></p>
<p><strong> </strong></p>
<p><strong>What has data mining got to do with the marketing?</strong></p>
<p>The digital advent has highly increased the necessity for smarter ways of advertising through internet. Advertisers have started to thoroughly examine and debate data mining and other new sciences that will shape the interactive marketplace. It is sure to be broached in discussions about consumer trust, and maybe even in a Facebook session aptly titled “Knowing is Better.”</p>
<p><strong> </strong></p>
<p><strong> </strong></p>
<p>An instance of the same…..</p>
<p>An ironic prelude to the week-long fete of advertising’s digital future was the Sept. 18 settlement of a privacy lawsuit related to Facebook’s social ad experiment, Beacon. The short-lived, poorly executed program riled online consumers, whose purchase information with off-site retailers such as Zappos and Blockbuster was unexpectedly shared with their Facebook friends. Their only recourse was to “opt out” of the program after the damage was done. While there are mounting examples of online consumers trading their personal information and privacy for more targeted interactive results, Beacon assumed too much with its initial tacit user approval.</p>
<p><strong> </strong></p>
<p><strong> </strong></p>
<p><strong>Going the giant’s way…</strong></p>
<p>All major internet giants like  Google, Yahoo, Microsoft, etc. have already gone far and wide in dealing with user data. They have reports saying how their research with user data has proven to be fruitful. User insights, social connections, personal preferences and buying history — which Amazon and Google already masterfully manipulate — are building blocks for an interactive economy that relentlessly exploits links to generate revenues.</p>
<p>Forrester Research shows that even as digital grows from 12 percent of existing overall advertising spend to 21 percent (or $55 billion) by 2014, there is a pressing necessity for  companies to  master constructive interactive relationships with consumers and each other to generate many times that in digital sales and other transactions.</p>
<p><strong> </strong></p>
<p><strong> </strong></p>
<p><strong>So why this sort of marketing now?</strong></p>
<p>Though recession has considerably reduced the advertising budgets of the company and the consumer spending allocations, there are signs of few marketers who drift consumers and technology into the interactive future.</p>
<p>What exactly will advertisers and media do with those interactive connections, and the insights and information they yield? What do you think?</p>
<p><strong>-Vidhya, Student Intern</strong></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fdataminingtools.net%2Fblog%2F2009%2F09%2F23%2Fadvertising-the-data-mining-way%2F&amp;linkname=Advertise%20the%20data%20mining%20way"><img src="http://dataminingtools.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://dataminingtools.net/blog/2009/09/23/advertising-the-data-mining-way/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

