R – your tool for data analysis

R‬ is a language and environment for Statistical Computing and Graphics. R provides a free/open source, cross­-platform, object-oriented environment to perform data analysis and visualisation tasks. Strength of R lies in its vibrant community, robust package repository and strong graphics capabilities. ‪

R provides all necessary tools required for various stages of a data analysis project. It provides techniques for data acquisition and processing as well as for data analysis and visualisation. It ranges from accessing data in various formats (CSV, XML…) to all possible ways of data manipulation (tabulation, aggregation…)to rich support for graphics (histogram, box plot etc.) to statistical models (regression, ANOVA…).

It is not always necessary to use built-in and supported functions and packages. Depending on the requirements, one can also develop his/her own functions, scripts and packages.

Recently, CDAC Mumbai has announced a 3-day course on R entitled “Using R for Data Visualisation and Analytics”. This course is aimed to cover in detail the features of R related to data analysis and visualisation. More details can be accessed here.

AlphaGo: Possible repercussions and India

ET’s editorial on March 11, 2016 talks about AlphaGo and draws some interesting sketches. At one point, “An AI-run factory, goes a joke, employs just a man and a dog. The dog’s job is to keep the man away from the factory. Why have the man at all, in that case? Someone has to feed the dog.”,
From the same editorial – A possible scenario for India: “AI will enhance productivity and profits for all companies that can master it and deploy it. Much of India’s advanced IT services industry might get replaced by AI, unless industry itself deploys AI. Indian universities have to teach and advance AI in all its myriad forms. India’s human intelligence potential must be realized, for the Indian economy to benefit from AI rather than be its victim.”
Lets wait and watch how it unfolds.

Call for Participation – Short-term Courses on Data Science at CDAC, Kharghar, Navi Mumbai

CFP-img

We are living in a Data Age. Data is being continuously generated and consumed in various formats, and sizes from a number of varied sources. This data can be a big asset if stored, processed and analysed efficiently in real time with the help of intelligent algorithms. There is a growing interest to utilize such data for the improvement of business, health, education, society, etc. There are many ways to process and analyse such data spanning techniques like data visualisation, text analysis, predictions and recommendations etc. Applications of these techniques can give companies and organisations valuable insights leading to competitive advantage, efficient service delivery and above all customer satisfaction. And so the demand for skilled resources in these fields is growing day by day.

With this view, CDAC, Mumbai is announcing the following short-term courses in Data Science and Machine Learning.

  1. Using R for data visualization and analytics: This course introduces R – a language and environment for Statistical Computing and Visualisation. In recent years, R has become very popular due its open source cross-platform nature, robust package repository and strong graphics capabilities. During the course, one will not only learn about basics of R, but also about techniques of data acquisition and processing. Course will also cover in detail the features of R related to data analysis and visualisation.
  2. Text Analytics: The course aims to provide learners an understanding of the methods for text analytics. It will cover major techniques for mining and analyzing text data to discover interesting patterns, extract useful knowledge, and support decision making. The techniques will include Named Entity Recognition, Sentiment Analysis and Text Categorization among others. Learners will also be introduced to various open source utilities for developing text analytics applications.
  3. Predictive Analytics and Recommender Systems: The course covers various methods of Predictive Analytics and Recommender Systems drawn from Statistics, Data Mining, and Machine Learning. We will discuss popular algorithms in the domain and their use in various applications. The course emphasizes hands-on approach for better understanding of the techniques used in the domain. During the course, mainly open-source tools will be used for illustrations and lab.

Target Audience: Individuals, students, and professionals from government, industry, and academia working / interested in Data Science

Courses Schedule:

Course Name Using R for data visualization and analytics Text Analytics Predictive Analytics and Recommender Systems
Course Dates May 19 – 21, 2016 June 16 – 18, 2016 July 14 – 16, 2016
Final Registration Date May 04, 2016 June 01, 2016 June 30, 2016

Registration Process: Registration fee per course for a candidate is Rs. 7500/-. For more details about registration and payment process, please visit http://www.kbcs.in/datascience.

Note: Registration will be on first come first serve basis. Final participation in any of the courses will be subject to the realization of payment of applicable registration fee.

For More details, please contact:

Centre for Development of Advanced Computing (Formerly NCST)

Near Bharati Vidyapeeth, Raintree Marg, Sector 7, CBD Belapur,

Navi Mumbai – 400614, Maharashtra, INDIA

Telephone: + 91-22-27565303/304/305

Fax: +91-22-27565004

email: kbcs@cdac.in

URL: http://www.kbcs.in/datascience

Business Intelligence And Data Warehouse Workshop being organized by CDAC, Mumbai

C-DAC Mumbai (formerly NCST) and Computer Society of India (CSI) are jointly organizing a four days intensive workshop on BI DW: Tools and Techniques from December 01-04, 2010. The Knowledge Based Computer Systems division at C-DAC Mumbai carries out research and development in core areas of Natural Language Processing, Expert Systems, Case Based Reasoning, Information Retrieval, Data Mining, Soft Computing and Planning and Scheduling.

The workshop intends to provide a comprehensive overview of the field of Business Intelligence and Data Warehousing with hands-on practical experience on tools like Weka, Mondrian and Kettle. The workshop will also provide an opportunity for interactions among the participants from different academic institutions, research organizations and industry practitioners in India. The workshop plans to invite distinguished researchers and professionals to give keynote lectures in emerging Business Intelligence and Data Warehousing technologies.

Many organizations are awash with data, but starving for information and getting out intelligence of it. However, some organizations have successfully developed certain analytical competencies and gets the benefits in this competitive edge. Intelligence can make good impact on a organization growth.

With this view, this workshop provides an opportunity to study Business Intelligence and Data Warehousing in detail and promote the usage of Data Warehousing and Data Mining concepts in projects with Gigabyte or Terabyte of data.

Target Audience:

  • Software Developers/ BI professionals
  • Researchers
  • Professors
  • Students

For more details about the workshop and registration process, visit http://bidw.cdacmumbai.in/.

Google Offers Cloud-Based Learning Engine

How could providing developers with machine learning on tap unleash a flood of smarter apps? What is Google Prediction API? How will it help developers to create software that learns how to handle incoming data? Read at http://www.technologyreview.in/computing/26093/page1/.

Indianizing with Rupantar- An OpenOffice.org extension from CDAC, Mumbai

In a diverse country like India, localisation of information in the respective languages of the people works as a way of empowerment for the people. And, open source provides the necessary technical platform for delivering that information at some reasonable cost. CDAC Mumbai has long been involved in developing tools/technologies in the area of localisation and open source. These include Xlit, Matra, SuTra, Anumaan. New tool to join this list is Rupantar.

Rupantar is an extention for OpenOffice.org developed by the KBCS division of CDAC Mumbai. This extension works as a utility to write in Devnagri script, hence allows a user to type in Hindi and Marathi language. It uses the Rupantar scheme, also developed by CDAC Mumbai, to convert from Roman to Devnagri. More details about downloading and installing the extension can be accessed at http://extensions.services.openoffice.org/project/rupantar.

Related blog posts:

http://matra2.blogspot.com/2010/03/rupanatar-openoffice-writer-extension.html

http://blogs.siliconindia.com/light/Writing_in_HindiMarathi_Indianising_information_and_softwares-bid-cs2Li6d193012540.html

For feedback about the extension mailto:prakash@cdacmumbai.in.

SIGAI Workshop on Emerging Research Trends in AI, 2010

Artificial Intelligence has always been a challenging field providing lots of opportunities of research and development in no. of areas like language processing, multi-agent systems,  web mining, information retrieval, semantic web, e-learning, optimization problems, pattern recognition etc. And, there is growing interest in the field in India between academia and industria both.

Keeping this in mind, The Special Interest Group on AI (SIGAI) of Computer Society of India (CSI) announces a workshop on “Emerging Research Trends in AI”. The workshop will be organized and hosted by CDAC,  Navi Mumbai, and is meant to encourage quality research in various aspects of AI, among the Indian academia/industry.

For more details, see Call for Papers and visit: http://sigai.cdacmumbai.in/index.php/ertai-2010

multi-agent systems,  web mining, information retrieval, semantic web, e-learning, optimization problems, pattern recognition, etc.

XlitHindi extension for OOo released by KBCS, CDAC Mumbai

KBCS division of CDAC Mumbai has released its transliteration extension for the OpenOffice writer. This tool enables to write in Hindi very easily without knowing the Hindi typing. It is very much helpful for the people using Hindi for the e-communication and documentation.More details of the English to other Indian Language transliteration system can be read at:http://www.cdacmumbai.in/xlit. To download the extension and other details, visit : http://extensions.services.openoffice.org/project/xlithindi.