Commit graph

66 commits

Author SHA1 Message Date
Joshua Eckroth
8aeed64130 Changed classifier-classify to produce the class label; updated classifier-label to avoid using classifier-classify. 2013-08-06 03:41:12 -04:00
Joshua Eckroth
db70ee980f Fixed indentation. 2013-07-31 06:50:59 -04:00
Joshua Eckroth
3ead98c527 Added k-nearest neighbor classifier (:lazy :ibk) 2013-07-16 23:29:45 -04:00
Joshua Eckroth
26a9d69c05 Fixed saving/loading csv instances. 2013-07-11 07:40:32 -04:00
Joshua Eckroth
b8cc877c05 Bugfix and added empty test case to force loading/compiling of clj-ml.public-datasets. 2013-07-04 09:42:03 -04:00
Joshua Eckroth
a8ccca6d53 Support stemmers (use Porter/English/etc. rather than lowercase form).
Bumped to version 0.3.7.
2013-07-01 14:07:34 -04:00
Joshua Eckroth
4aa08595a1 More fixes for svm grid operation; added libsvm to project deps. 2013-06-21 01:00:55 -04:00
Joshua Eckroth
76fb70c602 Fixed broken test case. 2013-06-21 00:22:48 -04:00
Joshua Eckroth
e0d17445f0 Formatting. 2013-03-30 18:16:08 -04:00
Joshua Eckroth
05cf721ec5 Changed resample filter to resample-unsupervised, and added resample-supervised. 2013-03-22 12:35:24 -04:00
Joshua Eckroth
54eb9cf78b Added resample filter. 2013-03-22 10:58:03 -04:00
Joshua Eckroth
30d2c1192f Added an instance-loading test. 2013-03-22 10:57:49 -04:00
Joshua Eckroth
7004dd87cd Added :reorder-attributes filter. 2013-03-22 10:09:38 -04:00
Joshua Eckroth
ed3bf1d71d Added StringToWordVector filter. 2013-03-22 07:55:01 -04:00
Joshua Eckroth
436d6701cd Bumped version, updated dependencies (new stable Weka), fixed a test. 2013-03-21 13:49:27 -04:00
Ben Mabey
dc12081b4a attaches instance weight info as metadata when converting to a map 2012-03-26 12:20:45 -06:00
Ben Mabey
042de1895a updates project to clojure 1.3, removes contrib and autodoc deps 2012-01-30 15:27:54 -07:00
Ben Mabey
c25d39521b removes mongodb store.. I never used it and won't be supporting it 2012-01-05 13:29:45 -07:00
Ben Mabey
fa2d39ffe5 change the dataset's name when randomized (like the filters do) 2011-12-13 20:10:11 -07:00
Ben Mabey
8a01361512 adds take-dataset 2011-12-08 15:28:27 -07:00
Ben Mabey
d313f621d1 adds remove-range filter so split-dataset can take a number to split on 2011-12-08 15:13:34 -07:00
Ben Mabey
0c32c318af adds split-dataset via new remove-percentage filter wrapper 2011-12-07 11:15:19 -07:00
Ben Mabey
84b1903b63 adds simple wrappers for Weka's attribute (feature) selection package 2011-10-04 13:10:01 -06:00
Ben Mabey
7357a720e9 makes representation of NaNs (no values) symmetric with creation- nil 2011-04-05 13:55:21 -06:00
Ben Mabey
a83e21d72b allows for creation of instances (and datasets) with nil values 2011-04-05 13:34:23 -06:00
Ben Mabey
44df9744c1 adds dataset-as-vecs and dataset-as-lists 2011-01-04 10:02:48 -07:00
Ben Mabey
8cb6620b6f adds dataset-class-name 2010-12-30 10:14:57 -07:00
Ben Mabey
7cd8e08446 makes attribute-labels fn that returns set of labels and renames old one 2010-12-29 15:08:05 -07:00
Ben Mabey
228c58f74c adds headers-only fn to extract the headers of a weka dataset 2010-12-22 14:23:44 -07:00
Ben Mabey
68cf83b712 fixes dataset-format to return nominal labels in correct order
Also made it more idiomatic and concise by using reduce...
2010-12-16 16:58:53 -07:00
Ronny Bjarnason
9829c0eea4 fix typo decission->decision in tests 2010-12-14 15:16:39 -07:00
Ben Mabey
4d18af461e not keywording the instance nominal values by default due to performance
On large datasets if you use dataset-as-maps too much time is taken up
in interning the nominal values time and time again.  For the vast
majority of use cases string values should be fine.  If we were in
clojure-land 100% then keywords would be the better option.. but we
aren't.
2010-12-08 16:10:45 -07:00
Ben Mabey
81cda11ed2 DRYs up some fns and replaces custom code with enumeration-seq 2010-12-08 16:05:41 -07:00
Ben Mabey
652c54aaf7 standardizes how attributes can be asked for (by index or name) 2010-12-07 14:14:20 -07:00
Ben Mabey
99048715f1 adds tests for dataset label helper fns 2010-12-07 13:48:52 -07:00
Ben Mabey
138ee7f016 adds dataset-class-values 2010-12-06 17:36:36 -07:00
Ben Mabey
b6deb0421e improves filters API by making filter fns that create and apply filters 2010-12-02 11:44:38 -07:00
Ben Mabey
3a393ebcf9 adds make-apply-filters 2010-12-01 16:22:19 -07:00
Ben Mabey
ca7281d214 allows attributes in filters to be specified by name in addition to index 2010-12-01 14:36:05 -07:00
Ben Mabey
78fc93c2a4 allow for dataset-set-class to take the name of attribute 2010-12-01 11:37:27 -07:00
Ben Mabey
1f3900a628 replaces some loop/recurs with more conciser idiomatic clojure 2010-11-24 12:05:30 -07:00
Ben Mabey
ccc12c2d5c adds dataset-replace-attribute! 2010-11-08 10:37:42 -07:00
Ben Mabey
2d274d2598 adds into-fast-vector util fn 2010-11-08 10:37:42 -07:00
Ben Mabey
eabb2acca5 adds helper fns in data and fixes some docstring misplacements 2010-11-08 10:37:42 -07:00
Ben Mabey
55ff30fcfa fixes is-dataset? to report correctly 2010-11-04 11:50:42 -06:00
Ben Mabey
249424ad81 adds ClojureBatchFilter 2010-11-03 11:10:41 -06:00
Ben Mabey
1412e9a86f allows ClojureStreamFilter to use a fn for updating the dataset-format 2010-11-02 17:42:03 -06:00
Ben Mabey
78a99eaa35 adds basic ClojureStreamFilter which accepts a clojure fn for processing 2010-11-02 16:51:15 -06:00
Ben Mabey
d87640e464 adds add-attribute filter 2010-11-01 17:13:35 -06:00
Ben Mabey
b09a59025f adds numeric to nominal filter 2010-11-01 14:05:11 -06:00