wood burning stoves 2.0
The moose likes Other Languages and the fly likes R's performance Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Languages » Other Languages
Bookmark "R Watch "R New topic

R's performance

Yukun Song

Joined: Jun 06, 2010
Posts: 9
Hi Robert,

Is R intended to satisfy the requirement of performance for large scale enterprise application, in terms of high speed and high frequency processing, concurrent computing and scalibility, when it is integrated with other languages like Java or C++ or used solely? Does the book R in Action cover similar topics?


Robert Kabacoff
Ranch Hand

Joined: Mar 28, 2011
Posts: 33
Hi Ykun,

With the massive stores of data now being collected, this is an increasingly important question. R was orginally designed to handle moderate to large amounts of data (in the megabyte and gigabyte range). It keeps the data in memory, which leads to a zippy experience for interactive users, but creates limits for very large datasets. Most users keep their data in external databases or data warehouses and access portions of it through R's extensive DBMS access routines.

R can handle data stores in the terabyte range, but requires the use of special procedures and routines (see for example, High-Performance and Parallel Computing with R).

Appendix G in "R in Action" describes working with large datasets.
Yukun Song

Joined: Jun 06, 2010
Posts: 9
Thanks for your reply, Robert.

I was thinking that R should have some kind of feature of scalability available for any package so that package developer and R's user can focus on their scientific computing or application integration without need to worry about its performance particularly scalibility. Maybe my thought is too luxury, since it's hard for any computing language.

I agree. Here's the link: http://aspose.com/file-tools
subject: R's performance
jQuery in Action, 3rd edition