How can I discretize a set of values in WEKA?

How can I discretize a set of values in WEKA? - weka

I have a set of integer values from 1 to 50 and I´d like to discretize this set in order to work with ranges instead of individual values (they are a huge amount of numbers for a decision tree). Let´s say I want a range from 1 to 25 and another range from 26 to 50. I´ ve tried with the Discretize filter but I don´t get anything to work.
Do you know how to do this in WEKA?
Thank you in advance

Try to set them as I did in this screenshot:
Here I used the ionosphere.arff dataset. AttributeIndecices "1" means "column 1".

Related

How to use proc means in SAS to calculate the mean of a value and divide it by 12 (so it would show up in a table)

I know how to find the mean but haven't found a way to divide it by some kind of number.
what it looks like: (how to divide all the number values by 12?)
inserted a pic of how i got it:

Is it possible to create a field well with the numbers 1-100 in?

I am currently creating a pivot table in Quicksight which I am looking to create a Field Well with a series of numbers from 1 to 100. They should be static and not effect the data in the table though. I have managed to do this in PowerBI before using the GENERATESERIES function.

Assuming you have 100 rows in your table, you could create a calculated field that uses denseRank() to assign every row a number from 1 to 100. You would need to make sure you rank on field(s) that do(es) not have duplicates. An index-based rank could work, for example.

Amazon Quicksight Calculated Fields

I'm having a problem with what should be a very simple calculated field. I have two conversion numbers, let's say a "7 day click" number and a "1 day view" number.
I'm trying to create a calculated field that simply adds these two numbers together.
Yet, this is the result I'm getting:
As we can see, the all conv value is not simply adding the other two columns. What am I doing wrong in the calculated field?
Thanks in advance, as I've spent way too much time trying to figure out such a simple thing
EDIT:
It appears that, when I add a new grouping, that the problem consists of null values in one of the two columns. example:
How can I change the calculated field to account for this? In the data warehouse, these blank values are just null values.
Thanks

I tried out what You did and it works fine for me:
QuickSight Table
Calculated field
I suppose there is something wrong with Your dataset.
Probably the granularity of the date field is not day but something else like hour and the calculated field tries to add eg. 11 Jun 10:00 + 12 Jun 11:00.
This is of course wrong. You need to add 11 Jun + 12 Jun.
Probably You need to extract the day of Yours date field.

to my question about the calculated field, this was what I used that worked.
ifelse( isNull({conv_1dv}), 0, {conv_1dv}) + ifelse(isNull({conv_7dc}),0,{conv_7dc})

In DataPrep, sum a set of many columns or values in an object

I have a DataPrep dataset which contains a series of ~10 columns, each of which indicates whether or not a particular brochure was selected:
BRO_AF BRO_SAF BRO_SE ...
1 1
1 1
1
I'd like to sum/count these values into a BrochuresSelected column.
I was hoping to use ADD with a column range (ie BRO_AF~BRO_ITA), but ADD only takes two numbers.
I can't use COUNT, as it counts rows not columns.
I can use NEST to create a column storing a map or array of brochures, but there doesn't seem to be a function for adding these. I can't use ARRAYLEN on this column, as even empty columns are represented in the column (eg ["1","","","","",""] would have an array length of six, not one).
Has anyone solved a similar issue?

If you know the column names, you can use the + operator in a derive transform. For example:

How to sum entries that have same ID OpenOffice - Calc

I have a spreadsheet similar to the one in the screenshot.
From this I want to sum all the entries in Data 2 which have the same Data 1 ID and store it in another column. So something like this:
I am not able to figure out the formula which would do this. I figured out how to get a column with unique entries I just need to figure out how to get the sum of the values which have the same data 1 id.
Can someone point me in the right direction?

You can use SUMIF, e.g. if I'm reading your sheet right, =SUMIF(A$2:A$7, A11, B$2:B$7), and then copy down. This sums the values from B2-B7 whenever the corresponding value in A2-A7 matches A11.
You can find more on SUMIF here.

you may use subtotals function under the Data tab, although it gives you the answer in the row below all the cell matching your id; then you have to click the - and + buttons that appear on the left...you may see these in this picture of my data
this is a nice resource when you have non-English characters(á, ó, ß,...), spaces, dashes and points, so it becomes difficult to process with sql

We Keep Coding

c++ django amazon-web-services regex python-2.7 google-cloud-platform list unit-testing opengl ember.js

How can I discretize a set of values in WEKA? - weka

Try to set them as I did in this screenshot: Here I used the ionosphere.arff dataset. AttributeIndecices "1" means "column 1".

Related

How to use proc means in SAS to calculate the mean of a value and divide it by 12 (so it would show up in a table)

Is it possible to create a field well with the numbers 1-100 in?

Amazon Quicksight Calculated Fields

In DataPrep, sum a set of many columns or values in an object

How to sum entries that have same ID OpenOffice - Calc

Categories

Resources