Archive for the ‘news’ Category

SAP buying Sybase: A $5.8 Billion Deal!

Friday, May 14th, 2010

SAP buying Sybase for $5.8 billion. The acquisition will allow SAP to own mobile and cloud computing technology which would aid its products future development. The deal will also provide substantial revenue streams and technology stake-hold that it can use to stay competitive with other industry giants.

About SAP:

Founded in 1972, SAP has a rich history of innovation and growth as a true industry leader. SAP currently has sales and development locations in more than 50 countries worldwide and is listed on several exchanges, including the Frankfurt Stock Exchange and NYSE under the symbol “SAP.”

About Sybase:

Sybase has been a leader in developing and expanding innovative database technology. Since our founding in a Berkeley, Calif., home in 1984, we have earned the trust of many of the world’s leading companies for our ability to manage information and deliver unsurpassed levels of data reliability and security. Today, Sybase leads the industry in delivering enterprise software to manage, analyze and mobilize information. We are recognized globally as a performance leader, proven in the most data-intensive industries and across all major systems, networks and devices.

[eWeek]

  • Share/Bookmark

Can academia projects help businesses? UoA professor says yes!

Monday, March 29th, 2010

Text mining tools can summarize and look for patterns within large electronic documents. Such tools are still expensive and difficult to use on large scale. But a group of researchers, including one’s at the University of Alberta, are hoping to change that.

A University of Alberta professor is helping to create text analysis tools to deeply examine historical trial accounts from the U.K.’s famous Old Bailey criminal court. While the research project is important to academia, the Edmonton-based researcher said that improving the quality of text mining tools could have benefits for businesses as well.

While academia are developing tools like TAPoR, a textual analysis tool that can summarize a body of text, find collocates, identify important dates, and discover the co-occurrences of two target words, the same could be applied to business records as well. Some of the tools in TAPoR use forms of visualization to help researchers grasp the data even clearer.

[IT Canada]

  • Share/Bookmark

Machine Learning at Stanford University

Sunday, March 28th, 2010

This time we bring to you an online video tutorial on CS 229: Machine Learning course from Stanford University. Besides 19 Lectures on machine learning you will be provided with relevant lecture notes, links other related miscellaneous stuff . Professor Andrew Ng will introduce  machine learning and statistical pattern recognition which will cover topics like supervised learning, unsupervised learning, learning theory, reinforcement learning and adaptive control. Other applications of machine learning, such as to robotic control, data mining, autonomous navigation, bio informatics, speech recognition, and text and web data processing are also discussed.

This was the first lecture:- (course materials) The first 30 minutes or so of this lecture is introduction to the course and the field of machine learning in general.

  • In the rest of the lecture, the four main parts of the course is described in some detail along with illustrative examples :-Supervised learning: providing the algorithm a data set, supervising- learn the association between input & output, regression problems, classification problems, support vector machines- infinite number of features
  • Learning theory
  • Unsupervised learning: clustering, Cocktail party problem, independent component analysis Reinforcement learning: reward function (good dog, bad dog), feedback function

In the second lecture, Professor Andrew Ng will cover topics like linear regression, gradient descent, and normal equations and discusses how they relate to machine learning.

For more  visit the home page: http://cs229.stanford.edu/

  • Share/Bookmark

Annual survey shows the importance & high spirits towards data mining

Sunday, March 28th, 2010

As a service to the data mining community, RexerAnalytics conducts an annual online survey (started in 2009). It analyzes some factors like experiences, priorities, views and challenges being faced by data mining industry. The Third Annual Data Miner Survey results were announced after studying the reports of 710 respondents from the data mining community. It was concluded that data miners and their organization’s are highly confident and happy with their services and analytic capabilities giving a feedback of  “above average” or “excellent” performances. Most of them even assured that economy conditions will never be a set back or a weak point for them. According to the survey result, the most commonly used and most satisfying primary data mining tools this year are IBM SPSS Modeler (SPSS Clementine), Statistica, and IBM SPSS Statistics (SPSS Statistics). Open source tool Weka is increasingly used by both academic and for-profit data miners. SAS Enterprise Miner dropped in data miner’s tool rankings this year.

Some highlights:

  • 40-item survey of data miners, conducted on-line in early 2009.
  • 710 participants from 58 countries.
  • Data miners’ most commonly used algorithms are regression, decision trees,
    and cluster analysis.
  • Half of data miners say their results are helping to drive strategic
    decisions and operational processes.
  • 58% say they are adding to the knowledge base in the field.
  • 60% of respondents say the results of their modeling are deployed
    always or most of the time.
  • Most data miners feel that the economy will not negatively impact them.
  • Almost half of industry data miners rate the analytic capabilities of their
    company as above average or excellent.  But 19% feel their company has
    minimal or no analytic capabilities.
  • The top challenges facing data miners are dirty data, explaining data mining
    to others, and difficult access to data.  However, in 2009 fewer data miners
    listed data quality and data access as challenges than in the previous year.
  • IBM SPSS Modeler (SPSS Clementine), Statistica, and IBM SPSS Statistics
    (SPSS Statistics) are identified as the “primary tools” used by the most data
    miners.
  • Open-source tools Weka and R made substantial movement up data
    miner’s tool rankings this year, and are now used by large numbers of
    both academic and for-profit data miners.
  • SAS Enterprise Miner dropped in data miner’s tool rankings this year.
  • Users of IBM SPSS Modeler, Statistica, and Rapid Miner are the most
    satisfied with their software.
  • Fields & Industries:  Data mining is everywhere.  The most sited areas are
    CRM / Marketing, Academic, Financial Services, & IT / Telecom.  And in the
    for-profit sector, the departments data miners most frequently work in are
    Marketing & Sales and Research & Development.

[RexterAnalytics]

  • Share/Bookmark