Win a copy of Design for the Mind this week in the Design forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Collaboratively data mining projects on a large, open-ended clojure-lab on git hub

 
jay vas
Ranch Hand
Posts: 407
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi guys :

Any one interested in collaboratively working on a large, open-ended clojure-lab on git hub ?

I have trying to get a group of Clojure folks together to collaborate on a project on GitHub that will be both interesting, potentially usefull, and extremely educational.

The idea is that, as programmers learning LISP, we might be able to feed off of one anothers projects and examples -- project euler is simply not usefull or insightful enough. To understand Clojure - we need to be doing real things, in high volume : parsing jsons, connecting to databases, and solving domain-specific problems. If we individually do this in separate repos, then we reduce our learning throughput --- but if we share a repository full of serious, real-world clojure mini-apps and projects, we can begin to feed off of our collective knowledge....

To kick-start things, we (as bioinformaticians) created the BioClojure project on GitHub, and got some serious interest and learned alot. We got to sample the powerful java interop, leiningan for getting maven repositories, and many other niceties of the language, thanks to contributions from Lee Hinnman and others.

Now, to open this up to a broader group of Clojure developers, we are migrating it to a much broader audience (we hope), and naming the project RudolF (many of our day jobs percieve functional / dynamic languages in much the same way santa's reindeer once percieved Rudolph's funny red nose)....

So... If so ping me and we can start carving out some pedagogical projects. The next two days we will be adding alot of Clojure and Haskell stub code to the GitHub Rudolf repository..... Hope to see you guys there.
 
Sean Corfield
Ranch Hand
Posts: 302
10
Clojure Linux Mac OS X
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Link to the github repo?
 
jay vas
Ranch Hand
Posts: 407
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thanks Sean ....... Here is the URL. There is some pretty cool Clojure bioinformatics data visualization stuff in there now, and also, some Haskell for dealing with large sets of random data samples in sampling experiments.

https://github.com/jayunit100/RudolF
 
Sean Corfield
Ranch Hand
Posts: 302
10
Clojure Linux Mac OS X
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
The first interesting exercise I'd suggest would be migrating the code to Clojure 1.3.0 and the new modular contrib libraries (instead of the old monolithic contrib).
 
jay vas
Ranch Hand
Posts: 407
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Good idea... We will be migrating bioclojure to jan aerts hub soon, the stuff on there is just proof of concept....that biojava, lien, and clojure really do play very nicely together.


I see you have some pedagogical projects to.... Maybe we Can merge then into rudolf or vice versa if your interested.
 
Sean Corfield
Ranch Hand
Posts: 302
10
Clojure Linux Mac OS X
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
jay vas wrote:I see you have some pedagogical projects to.... Maybe we Can merge then into rudolf or vice versa if your interested.

Not sure what you're referring to... macro-day is about the only one that fits that category and it's mostly covered by Clojure in Action.
 
jay vas
Ranch Hand
Posts: 407
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
ok thanks. we will keep pushing forward.
 
jay vas
Ranch Hand
Posts: 407
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi guys :

Thanks to some help from the vibrant clojure community, namely Sean Corfield and Lee Hinnman, including some very useful conversations on java moose, the a publication on the RudolF project has been accepted to ITNG for this spring !

.... Just to remind , RudolF is the open-source, machine learning / data mining app (a clojur sandbox project to demonstrate the power of the language), which I've been soliciting contributors for..... https://github.com/jayunit100/RudolF .

Here is a link to a copy of it, currently on lee hinmans blog : http://writequit.org/papers/files/RudolF.pdf

Thanks Sean for your advice.

If any of you have thoughts for next steps, or are interested in being on a second, follow up publication to this one by contributing to the code base, let me know
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic