• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Liutauras Vilda
  • Jeanne Boyarsky
  • Devaka Cooray
  • Paul Clapham
Sheriffs:
  • Tim Cooke
  • Knute Snortum
  • Bear Bibeault
Saloon Keepers:
  • Ron McLeod
  • Tim Moores
  • Stephan van Hulst
  • Piet Souris
  • Ganesh Patekar
Bartenders:
  • Frits Walraven
  • Carey Brown
  • Tim Holloway

Why is python the language used most for data science work?

 
Ranch Hand
Posts: 1115
4
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
For data science work ,python language is used a lot.Why is python the preferred language for this?
 
Bartender
Posts: 1146
38
IBM DB2 Netbeans IDE Spring Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
My two cents: data science and machine learning are essentially maths, calculus and algebra, and Python it's one of the richest language in terms of math libraries out there. Moreover, it's syntax is simple and effective, so that you can keep focused on the task you're working on, and leave most of the details to the language itself. Last but not least, it's overhead is limited with respect to Java, for example.
 
Bartender
Posts: 20828
125
Android Eclipse IDE Tomcat Server Redhat Java Linux
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I think you might need to make a distinction between "data science" and "science".

Historically, the Fortran programming language has been used for science. After all, it was designed for just that (For tran == Formula Translator - originally FORTRAN). And Fortran could compile on systems with 4K of memory back when a 4K mainframe was a serious investment. So a lot of math work was done there and a lot of support libraries were written in Fortran. And SAS - the Statistical Analysis System was designed around Fortran (and PL/1 - IBM's attempt to improve Fortran to handle general and business computing).

Fortran was also the preferred language for supercomputers.

Admittedly, Python has had a lot of interest from scientific users, though. Fortran is no longer the one and only language for scientific programming.

Data science, however, is a particular scientific sub-discipline and Python is better for non-numerical processing while still being fully usable for numerical work. And, unlike PL/1, efficient Python can be achieved for free on almost any modern computer system.
 
Marshal
Posts: 24586
55
Eclipse IDE Firefox Browser MySQL Database
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
My son (who is a professional scientist) usually writes code in R for the things he needs to do. I understand that's quite common, but I don't know whether Python outranks R or not.
 
Tim Holloway
Bartender
Posts: 20828
125
Android Eclipse IDE Tomcat Server Redhat Java Linux
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Good point. R is designed for stuff like that.
 
Monica Shiralkar
Ranch Hand
Posts: 1115
4
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
thanks
 
It is sorta covered in the JavaRanch Style Guide.
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!