Commit graph

138 commits

Author SHA1 Message Date
Ronny Bjarnason
1e11ee89fa Merge branch 'master' of github.com:leadtune/clj-ml 2011-03-15 16:52:54 -06:00
Ronny Bjarnason
a838cd2dad add SPegasos classifier 2011-03-15 16:52:23 -06:00
Ben Mabey
ed53d7d764 Merge branch 'master' of github.com:leadtune/clj-ml 2011-03-15 13:41:29 -06:00
Ben Mabey
c11f92a20b allow for streams to be passed into the load-instances fn 2011-03-15 13:41:02 -06:00
Ronny Bjarnason
73712d8384 add Rotation Forests 2011-02-21 14:23:09 -07:00
Ronny Bjarnason
e211354b42 Add Additive Regresson as a classifier (aka Gradient Boosted Decision Trees: GBDT) 2011-02-21 12:56:22 -07:00
Ben Mabey
4f8e7c11d2 adds dataset-set-name wrapper 2011-01-18 17:18:19 -07:00
Ben Mabey
af47246368 fixes incorrect type hinting that was causing the method not to be found 2011-01-11 13:49:05 -07:00
Ben Mabey
aa4048c580 yes.. more type hinting... 2011-01-04 17:23:41 -07:00
Ben Mabey
b066a65969 more type hinting 2011-01-04 17:06:09 -07:00
Ben Mabey
1678bf3b19 yep, you guessed it- more type hinting 2011-01-04 16:58:30 -07:00
Ben Mabey
0a2c30ccbe more type hinting.. 2011-01-04 16:45:29 -07:00
Ben Mabey
8bc6223e8f more type hinting/weka lack of Interfaces dance around 2011-01-04 16:38:17 -07:00
Ben Mabey
e909942d39 more type hinting fun...
totally awesome that Weka doesn't know how to use interfaces :/
2011-01-04 16:26:31 -07:00
Ben Mabey
0dfd143a62 adds attribute-labels-as-strings since keywording isn't always desired 2011-01-04 11:47:18 -07:00
Ben Mabey
8194b4531b faster version of attribute-labels 2011-01-04 11:40:09 -07:00
Ben Mabey
44df9744c1 adds dataset-as-vecs and dataset-as-lists 2011-01-04 10:02:48 -07:00
Ben Mabey
8cb6620b6f adds dataset-class-name 2010-12-30 10:14:57 -07:00
Ben Mabey
7cd8e08446 makes attribute-labels fn that returns set of labels and renames old one 2010-12-29 15:08:05 -07:00
Ben Mabey
ef4207ee88 extracts attribute-labels for general use 2010-12-29 14:52:16 -07:00
Ben Mabey
228c58f74c adds headers-only fn to extract the headers of a weka dataset 2010-12-22 14:23:44 -07:00
Ben Mabey
68cf83b712 fixes dataset-format to return nominal labels in correct order
Also made it more idiomatic and concise by using reduce...
2010-12-16 16:58:53 -07:00
Ben Mabey
54567b5611 fixes supervised-nominal-to-binary filter fn 2010-12-16 15:37:54 -07:00
Ronny Bjarnason
c89e559ace additional options to weka models 2010-12-14 15:04:42 -07:00
Ronny Bjarnason
44f63508c9 1. Create weka model wrappers for PaceRegression, RandomForests, M5P Trees
and boosted stumps (LogitBoost).
2. Added functionality in MultilayerPerceptron to specify hidden layers
3. corrected misspelling decission->decision
2010-12-13 17:01:25 -07:00
Ben Mabey
89578f113c comment 2010-12-08 16:14:28 -07:00
Ben Mabey
4d18af461e not keywording the instance nominal values by default due to performance
On large datasets if you use dataset-as-maps too much time is taken up
in interning the nominal values time and time again.  For the vast
majority of use cases string values should be fine.  If we were in
clojure-land 100% then keywords would be the better option.. but we
aren't.
2010-12-08 16:10:45 -07:00
Ben Mabey
5d59785f3b type hints 2010-12-08 16:05:46 -07:00
Ben Mabey
81cda11ed2 DRYs up some fns and replaces custom code with enumeration-seq 2010-12-08 16:05:41 -07:00
Ben Mabey
b47e794176 removes custom (and slow) nominal lookup code- just use weka's API 2010-12-08 14:29:25 -07:00
Ben Mabey
a34fe4913f adds linear and logistic regression wrapeprs 2010-12-07 17:24:22 -07:00
Ben Mabey
652c54aaf7 standardizes how attributes can be asked for (by index or name) 2010-12-07 14:14:20 -07:00
Ben Mabey
99048715f1 adds tests for dataset label helper fns 2010-12-07 13:48:52 -07:00
Ben Mabey
6a30d86841 adds dataset-nominal? 2010-12-07 09:32:27 -07:00
Ben Mabey
138ee7f016 adds dataset-class-values 2010-12-06 17:36:36 -07:00
Ben Mabey
0ceb8020fa changes values -> labels where appropriate to be consitent with weka terminology 2010-12-06 17:02:11 -07:00
Ben Mabey
49f99e3218 adds classifier-copy-and-train as a non-side-effect alternative 2010-12-06 16:47:50 -07:00
Ben Mabey
df8fcd467c type hint 2010-12-03 14:31:57 -07:00
Ben Mabey
b6deb0421e improves filters API by making filter fns that create and apply filters 2010-12-02 11:44:38 -07:00
Ben Mabey
3a393ebcf9 adds make-apply-filters 2010-12-01 16:22:19 -07:00
Ben Mabey
ca7281d214 allows attributes in filters to be specified by name in addition to index 2010-12-01 14:36:05 -07:00
Ben Mabey
78fc93c2a4 allow for dataset-set-class to take the name of attribute 2010-12-01 11:37:27 -07:00
Ben Mabey
93846bcaea type hint classifiers as much as possible... 2010-11-29 10:08:57 -07:00
Ben Mabey
31348dd1e0 reworks dataset-as-maps to be faster by avoiding redundant string interning 2010-11-24 12:12:41 -07:00
Ben Mabey
1f3900a628 replaces some loop/recurs with more conciser idiomatic clojure 2010-11-24 12:05:30 -07:00
Ben Mabey
88d65f0e24 removes custom key-to-str fn in favor for clojure.core/name 2010-11-23 16:09:13 -07:00
Ben Mabey
ccc12c2d5c adds dataset-replace-attribute! 2010-11-08 10:37:42 -07:00
Ben Mabey
285e45717a shorter, and more idomatic version of index-attr 2010-11-08 10:37:42 -07:00
Ben Mabey
2d274d2598 adds into-fast-vector util fn 2010-11-08 10:37:42 -07:00
Ben Mabey
19a827d8b3 sets the output-format for ClojureBatchFilter to use the process fn format 2010-11-08 10:37:42 -07:00
Ben Mabey
eabb2acca5 adds helper fns in data and fixes some docstring misplacements 2010-11-08 10:37:42 -07:00
Ben Mabey
55ff30fcfa fixes is-dataset? to report correctly 2010-11-04 11:50:42 -06:00
Ben Mabey
b7acdd5d8d adds documenation for the new clj-[streamable|batch] filters 2010-11-03 11:49:04 -06:00
Ben Mabey
c2f44f44e0 refactors branching in make-filter for the new Clojure filters 2010-11-03 11:20:06 -06:00
Ben Mabey
249424ad81 adds ClojureBatchFilter 2010-11-03 11:10:41 -06:00
Ben Mabey
1412e9a86f allows ClojureStreamFilter to use a fn for updating the dataset-format 2010-11-02 17:42:03 -06:00
Ben Mabey
36041956b5 another doto minor refactoring 2010-11-02 16:52:31 -06:00
Ben Mabey
78a99eaa35 adds basic ClojureStreamFilter which accepts a clojure fn for processing 2010-11-02 16:51:15 -06:00
Ben Mabey
f256984153 minor refactorings in data.clj 2010-11-01 17:39:27 -06:00
Ben Mabey
d87640e464 adds add-attribute filter 2010-11-01 17:13:35 -06:00
Ben Mabey
c2991b18b1 doc updates for filters 2010-11-01 14:46:24 -06:00
Ben Mabey
b09a59025f adds numeric to nominal filter 2010-11-01 14:05:11 -06:00
Ben Mabey
f403ca21d7 changes make-classifier-m from a macro into a fn 2010-11-01 11:20:59 -06:00
Ben Mabey
6ec5f1f2a0 similar options refactorings in classifiers that was done in filters 2010-11-01 11:14:07 -06:00
Ben Mabey
6dec810b12 replaces make-filter macro with fn and swaps multimethods with map 2010-10-29 19:13:35 -06:00
Ben Mabey
c100184e72 switches out custom code with doto macro 2010-10-29 18:00:27 -06:00
Ben Mabey
dfe20783af more filters DRYing 2010-10-29 17:58:07 -06:00
Ben Mabey
10a57b8224 swap arg order for check-options 2010-10-29 17:10:47 -06:00
Ben Mabey
51fc383c15 general filters refactoring- mostly DRYing stuff up 2010-10-29 16:57:20 -06:00
Ben Mabey
497d65dffc adds RemoveUseless filter wrapper and does some test refactoring 2010-10-29 15:55:29 -06:00
Ben Mabey
37e8ca05d7 removes clusterers dep on incanter- doesn't appear to need it 2010-10-27 16:35:48 -06:00
Ben Mabey
42b1fed112 adds missing require in clj-ml.io so it can load 2010-10-27 16:34:44 -06:00
Antonio Garrote
47e78f5fb4 Documentation for clusterers and small fixes in the clusterer options for :expectation-maximization clusterer 2010-03-16 09:22:49 +01:00
Antonio Garrote
06adde17e9 documentation for classifiers mostly finished 2010-03-08 16:32:38 +01:00
Antonio Garrote
ecd2e3579f Documentation and comments for filter.clj. Not finished yet. 2010-03-08 09:20:37 +01:00
Antonio Garrote
1e8d1d24ec Autodoc documentation support 2010-03-07 23:25:47 +01:00
Antonio Garrote
f91328f5e4 SMO Support Vector Machines and different kernel functions: polynomic, radia-basis and string kernels 2010-03-07 18:22:04 +01:00
Antonio Garrote
c1c7ba0ac5 Support for persisting data sets into MongoDB 2010-03-06 12:10:26 +01:00
Antonio Garrote
df2fe1913f select-append and project-attributes filters 2010-03-04 17:05:33 +01:00
Antonio Garrote
8facf1dfe2 EM clusterer added, remove-attributes filter added, improved syntax for filters and make-apply-filter form 2010-03-04 13:56:50 +01:00
Antonio Garrote
b13fd3b4b4 Updateable clusterers 2010-03-03 16:37:06 +01:00
Antonio Garrote
2670b037b2 Small improvements 2010-03-03 14:46:08 +01:00
Antonio Garrote
c37d36e5d3 Distance functions and distance function parameters for clusterers 2010-03-02 22:47:35 +01:00
Antonio Garrote
00ff6b1bc4 Visualization of more objects, clustering of collections and good clusterers evaluation 2010-03-02 21:24:11 +01:00
Antonio Garrote
fba9f97845 More functions for working with clusters and display of cluster and k-means centroids 2010-03-02 09:41:25 +01:00
Antonio Garrote
5d6d85fc0e Chart drawing support 2010-02-28 23:54:19 +01:00
Antonio Garrote
a7990c5373 Initial support for clustering algorithms 2010-02-28 20:29:51 +01:00
Antonio Garrote
360e507bd2 First commit 2010-02-28 13:14:17 +01:00