Commit graph

31 commits

Author SHA1 Message Date
Joel 'Aaron' Cohen
72e21f6e7a Minimal changes to compile against Weka 3.7.10
There is a weird junit dependency being pulled in that I haven't
figured out.
2014-03-19 19:18:32 -04:00
Joshua Eckroth
19ce704093 Fixed some bugs in new docs-to-dataset, added test. 2013-12-24 08:21:17 -05:00
Joshua Eckroth
937b9bf87d Fixed some tests that broke when nominal attributes changed from string to keyword representations. 2013-08-07 10:32:50 -04:00
Joshua Eckroth
db70ee980f Fixed indentation. 2013-07-31 06:50:59 -04:00
Ben Mabey
dc12081b4a attaches instance weight info as metadata when converting to a map 2012-03-26 12:20:45 -06:00
Ben Mabey
042de1895a updates project to clojure 1.3, removes contrib and autodoc deps 2012-01-30 15:27:54 -07:00
Ben Mabey
fa2d39ffe5 change the dataset's name when randomized (like the filters do) 2011-12-13 20:10:11 -07:00
Ben Mabey
8a01361512 adds take-dataset 2011-12-08 15:28:27 -07:00
Ben Mabey
d313f621d1 adds remove-range filter so split-dataset can take a number to split on 2011-12-08 15:13:34 -07:00
Ben Mabey
0c32c318af adds split-dataset via new remove-percentage filter wrapper 2011-12-07 11:15:19 -07:00
Ben Mabey
7357a720e9 makes representation of NaNs (no values) symmetric with creation- nil 2011-04-05 13:55:21 -06:00
Ben Mabey
a83e21d72b allows for creation of instances (and datasets) with nil values 2011-04-05 13:34:23 -06:00
Ben Mabey
44df9744c1 adds dataset-as-vecs and dataset-as-lists 2011-01-04 10:02:48 -07:00
Ben Mabey
8cb6620b6f adds dataset-class-name 2010-12-30 10:14:57 -07:00
Ben Mabey
7cd8e08446 makes attribute-labels fn that returns set of labels and renames old one 2010-12-29 15:08:05 -07:00
Ben Mabey
228c58f74c adds headers-only fn to extract the headers of a weka dataset 2010-12-22 14:23:44 -07:00
Ben Mabey
68cf83b712 fixes dataset-format to return nominal labels in correct order
Also made it more idiomatic and concise by using reduce...
2010-12-16 16:58:53 -07:00
Ben Mabey
4d18af461e not keywording the instance nominal values by default due to performance
On large datasets if you use dataset-as-maps too much time is taken up
in interning the nominal values time and time again.  For the vast
majority of use cases string values should be fine.  If we were in
clojure-land 100% then keywords would be the better option.. but we
aren't.
2010-12-08 16:10:45 -07:00
Ben Mabey
81cda11ed2 DRYs up some fns and replaces custom code with enumeration-seq 2010-12-08 16:05:41 -07:00
Ben Mabey
652c54aaf7 standardizes how attributes can be asked for (by index or name) 2010-12-07 14:14:20 -07:00
Ben Mabey
99048715f1 adds tests for dataset label helper fns 2010-12-07 13:48:52 -07:00
Ben Mabey
138ee7f016 adds dataset-class-values 2010-12-06 17:36:36 -07:00
Ben Mabey
ca7281d214 allows attributes in filters to be specified by name in addition to index 2010-12-01 14:36:05 -07:00
Ben Mabey
78fc93c2a4 allow for dataset-set-class to take the name of attribute 2010-12-01 11:37:27 -07:00
Ben Mabey
1f3900a628 replaces some loop/recurs with more conciser idiomatic clojure 2010-11-24 12:05:30 -07:00
Ben Mabey
ccc12c2d5c adds dataset-replace-attribute! 2010-11-08 10:37:42 -07:00
Ben Mabey
eabb2acca5 adds helper fns in data and fixes some docstring misplacements 2010-11-08 10:37:42 -07:00
Ben Mabey
55ff30fcfa fixes is-dataset? to report correctly 2010-11-04 11:50:42 -06:00
Antonio Garrote
2670b037b2 Small improvements 2010-03-03 14:46:08 +01:00
Antonio Garrote
eb60fe6d31 Some tests for clusterers 2010-02-28 20:51:17 +01:00
Antonio Garrote
360e507bd2 First commit 2010-02-28 13:14:17 +01:00