Ronny Bjarnason
1e11ee89fa
Merge branch 'master' of github.com:leadtune/clj-ml
2011-03-15 16:52:54 -06:00
Ronny Bjarnason
a838cd2dad
add SPegasos classifier
2011-03-15 16:52:23 -06:00
Ben Mabey
ed53d7d764
Merge branch 'master' of github.com:leadtune/clj-ml
2011-03-15 13:41:29 -06:00
Ben Mabey
c11f92a20b
allow for streams to be passed into the load-instances fn
2011-03-15 13:41:02 -06:00
Ronny Bjarnason
73712d8384
add Rotation Forests
2011-02-21 14:23:09 -07:00
Ronny Bjarnason
e211354b42
Add Additive Regresson as a classifier (aka Gradient Boosted Decision Trees: GBDT)
2011-02-21 12:56:22 -07:00
Ben Mabey
4f8e7c11d2
adds dataset-set-name wrapper
2011-01-18 17:18:19 -07:00
Ben Mabey
af47246368
fixes incorrect type hinting that was causing the method not to be found
2011-01-11 13:49:05 -07:00
Ben Mabey
aa4048c580
yes.. more type hinting...
2011-01-04 17:23:41 -07:00
Ben Mabey
b066a65969
more type hinting
2011-01-04 17:06:09 -07:00
Ben Mabey
1678bf3b19
yep, you guessed it- more type hinting
2011-01-04 16:58:30 -07:00
Ben Mabey
0a2c30ccbe
more type hinting..
2011-01-04 16:45:29 -07:00
Ben Mabey
8bc6223e8f
more type hinting/weka lack of Interfaces dance around
2011-01-04 16:38:17 -07:00
Ben Mabey
e909942d39
more type hinting fun...
...
totally awesome that Weka doesn't know how to use interfaces :/
2011-01-04 16:26:31 -07:00
Ben Mabey
0dfd143a62
adds attribute-labels-as-strings since keywording isn't always desired
2011-01-04 11:47:18 -07:00
Ben Mabey
8194b4531b
faster version of attribute-labels
2011-01-04 11:40:09 -07:00
Ben Mabey
44df9744c1
adds dataset-as-vecs and dataset-as-lists
2011-01-04 10:02:48 -07:00
Ben Mabey
8cb6620b6f
adds dataset-class-name
2010-12-30 10:14:57 -07:00
Ben Mabey
7cd8e08446
makes attribute-labels fn that returns set of labels and renames old one
2010-12-29 15:08:05 -07:00
Ben Mabey
ef4207ee88
extracts attribute-labels for general use
2010-12-29 14:52:16 -07:00
Ben Mabey
228c58f74c
adds headers-only fn to extract the headers of a weka dataset
2010-12-22 14:23:44 -07:00
Ben Mabey
68cf83b712
fixes dataset-format to return nominal labels in correct order
...
Also made it more idiomatic and concise by using reduce...
2010-12-16 16:58:53 -07:00
Ben Mabey
54567b5611
fixes supervised-nominal-to-binary filter fn
2010-12-16 15:37:54 -07:00
Ronny Bjarnason
c89e559ace
additional options to weka models
2010-12-14 15:04:42 -07:00
Ronny Bjarnason
44f63508c9
1. Create weka model wrappers for PaceRegression, RandomForests, M5P Trees
...
and boosted stumps (LogitBoost).
2. Added functionality in MultilayerPerceptron to specify hidden layers
3. corrected misspelling decission->decision
2010-12-13 17:01:25 -07:00
Ben Mabey
89578f113c
comment
2010-12-08 16:14:28 -07:00
Ben Mabey
4d18af461e
not keywording the instance nominal values by default due to performance
...
On large datasets if you use dataset-as-maps too much time is taken up
in interning the nominal values time and time again. For the vast
majority of use cases string values should be fine. If we were in
clojure-land 100% then keywords would be the better option.. but we
aren't.
2010-12-08 16:10:45 -07:00
Ben Mabey
5d59785f3b
type hints
2010-12-08 16:05:46 -07:00
Ben Mabey
81cda11ed2
DRYs up some fns and replaces custom code with enumeration-seq
2010-12-08 16:05:41 -07:00
Ben Mabey
b47e794176
removes custom (and slow) nominal lookup code- just use weka's API
2010-12-08 14:29:25 -07:00
Ben Mabey
a34fe4913f
adds linear and logistic regression wrapeprs
2010-12-07 17:24:22 -07:00
Ben Mabey
652c54aaf7
standardizes how attributes can be asked for (by index or name)
2010-12-07 14:14:20 -07:00
Ben Mabey
99048715f1
adds tests for dataset label helper fns
2010-12-07 13:48:52 -07:00
Ben Mabey
6a30d86841
adds dataset-nominal?
2010-12-07 09:32:27 -07:00
Ben Mabey
138ee7f016
adds dataset-class-values
2010-12-06 17:36:36 -07:00
Ben Mabey
0ceb8020fa
changes values -> labels where appropriate to be consitent with weka terminology
2010-12-06 17:02:11 -07:00
Ben Mabey
49f99e3218
adds classifier-copy-and-train as a non-side-effect alternative
2010-12-06 16:47:50 -07:00
Ben Mabey
df8fcd467c
type hint
2010-12-03 14:31:57 -07:00
Ben Mabey
b6deb0421e
improves filters API by making filter fns that create and apply filters
2010-12-02 11:44:38 -07:00
Ben Mabey
3a393ebcf9
adds make-apply-filters
2010-12-01 16:22:19 -07:00
Ben Mabey
ca7281d214
allows attributes in filters to be specified by name in addition to index
2010-12-01 14:36:05 -07:00
Ben Mabey
78fc93c2a4
allow for dataset-set-class to take the name of attribute
2010-12-01 11:37:27 -07:00
Ben Mabey
93846bcaea
type hint classifiers as much as possible...
2010-11-29 10:08:57 -07:00
Ben Mabey
31348dd1e0
reworks dataset-as-maps to be faster by avoiding redundant string interning
2010-11-24 12:12:41 -07:00
Ben Mabey
1f3900a628
replaces some loop/recurs with more conciser idiomatic clojure
2010-11-24 12:05:30 -07:00
Ben Mabey
88d65f0e24
removes custom key-to-str fn in favor for clojure.core/name
2010-11-23 16:09:13 -07:00
Ben Mabey
ccc12c2d5c
adds dataset-replace-attribute!
2010-11-08 10:37:42 -07:00
Ben Mabey
285e45717a
shorter, and more idomatic version of index-attr
2010-11-08 10:37:42 -07:00
Ben Mabey
2d274d2598
adds into-fast-vector util fn
2010-11-08 10:37:42 -07:00
Ben Mabey
19a827d8b3
sets the output-format for ClojureBatchFilter to use the process fn format
2010-11-08 10:37:42 -07:00
Ben Mabey
eabb2acca5
adds helper fns in data and fixes some docstring misplacements
2010-11-08 10:37:42 -07:00
Ben Mabey
55ff30fcfa
fixes is-dataset? to report correctly
2010-11-04 11:50:42 -06:00
Ben Mabey
b7acdd5d8d
adds documenation for the new clj-[streamable|batch] filters
2010-11-03 11:49:04 -06:00
Ben Mabey
c2f44f44e0
refactors branching in make-filter for the new Clojure filters
2010-11-03 11:20:06 -06:00
Ben Mabey
249424ad81
adds ClojureBatchFilter
2010-11-03 11:10:41 -06:00
Ben Mabey
1412e9a86f
allows ClojureStreamFilter to use a fn for updating the dataset-format
2010-11-02 17:42:03 -06:00
Ben Mabey
36041956b5
another doto minor refactoring
2010-11-02 16:52:31 -06:00
Ben Mabey
78a99eaa35
adds basic ClojureStreamFilter which accepts a clojure fn for processing
2010-11-02 16:51:15 -06:00
Ben Mabey
f256984153
minor refactorings in data.clj
2010-11-01 17:39:27 -06:00
Ben Mabey
d87640e464
adds add-attribute filter
2010-11-01 17:13:35 -06:00
Ben Mabey
c2991b18b1
doc updates for filters
2010-11-01 14:46:24 -06:00
Ben Mabey
b09a59025f
adds numeric to nominal filter
2010-11-01 14:05:11 -06:00
Ben Mabey
f403ca21d7
changes make-classifier-m from a macro into a fn
2010-11-01 11:20:59 -06:00
Ben Mabey
6ec5f1f2a0
similar options refactorings in classifiers that was done in filters
2010-11-01 11:14:07 -06:00
Ben Mabey
6dec810b12
replaces make-filter macro with fn and swaps multimethods with map
2010-10-29 19:13:35 -06:00
Ben Mabey
c100184e72
switches out custom code with doto macro
2010-10-29 18:00:27 -06:00
Ben Mabey
dfe20783af
more filters DRYing
2010-10-29 17:58:07 -06:00
Ben Mabey
10a57b8224
swap arg order for check-options
2010-10-29 17:10:47 -06:00
Ben Mabey
51fc383c15
general filters refactoring- mostly DRYing stuff up
2010-10-29 16:57:20 -06:00
Ben Mabey
497d65dffc
adds RemoveUseless filter wrapper and does some test refactoring
2010-10-29 15:55:29 -06:00
Ben Mabey
37e8ca05d7
removes clusterers dep on incanter- doesn't appear to need it
2010-10-27 16:35:48 -06:00
Ben Mabey
42b1fed112
adds missing require in clj-ml.io so it can load
2010-10-27 16:34:44 -06:00
Antonio Garrote
47e78f5fb4
Documentation for clusterers and small fixes in the clusterer options for :expectation-maximization clusterer
2010-03-16 09:22:49 +01:00
Antonio Garrote
06adde17e9
documentation for classifiers mostly finished
2010-03-08 16:32:38 +01:00
Antonio Garrote
ecd2e3579f
Documentation and comments for filter.clj. Not finished yet.
2010-03-08 09:20:37 +01:00
Antonio Garrote
1e8d1d24ec
Autodoc documentation support
2010-03-07 23:25:47 +01:00
Antonio Garrote
f91328f5e4
SMO Support Vector Machines and different kernel functions: polynomic, radia-basis and string kernels
2010-03-07 18:22:04 +01:00
Antonio Garrote
c1c7ba0ac5
Support for persisting data sets into MongoDB
2010-03-06 12:10:26 +01:00
Antonio Garrote
df2fe1913f
select-append and project-attributes filters
2010-03-04 17:05:33 +01:00
Antonio Garrote
8facf1dfe2
EM clusterer added, remove-attributes filter added, improved syntax for filters and make-apply-filter form
2010-03-04 13:56:50 +01:00
Antonio Garrote
b13fd3b4b4
Updateable clusterers
2010-03-03 16:37:06 +01:00
Antonio Garrote
2670b037b2
Small improvements
2010-03-03 14:46:08 +01:00
Antonio Garrote
c37d36e5d3
Distance functions and distance function parameters for clusterers
2010-03-02 22:47:35 +01:00
Antonio Garrote
00ff6b1bc4
Visualization of more objects, clustering of collections and good clusterers evaluation
2010-03-02 21:24:11 +01:00
Antonio Garrote
fba9f97845
More functions for working with clusters and display of cluster and k-means centroids
2010-03-02 09:41:25 +01:00
Antonio Garrote
5d6d85fc0e
Chart drawing support
2010-02-28 23:54:19 +01:00
Antonio Garrote
a7990c5373
Initial support for clustering algorithms
2010-02-28 20:29:51 +01:00
Antonio Garrote
360e507bd2
First commit
2010-02-28 13:14:17 +01:00