Converting mixed data set to numerical data set - data-mining

In my project i have to work with mixed dataset (i.e.it has both categorical and numerical data). Is there any algorithm or method for converting categorical values to numerical values so finally my dataset should contain only numerical values. Can anyone please help me out....
(Im doing my project in matlab)

Use one-hot encoding.
But don't expect the results to be very good. There is a lot of meaning lost this way.

Related

Why isn't any algorithm working in WEKA when my dataset is fully loaded?

I am trying to analyze a dataset in WEKA with a nominal class. However, all the other attributes have both numeric and nominal values but the final class has nominal values. All algorithm options except very few are showing up? Can you please tell me why this is happening?

Random Effects Model - Proc HPMIXED vs MIXED Dropped Intercept?

We have a large dataset, and when we run proc mixed or hpmixed on a sample of the data, we get the same coefficients for the continuous variables. However, for HPMIXED, SAS ignores the reference values we give it for categorical variables, and the intercept becomes 0.
Is there an option to force HPMIXED to use an intercept and to 'not ignore' the ref="" values we choose?
Thank you in advance for your help!

Is there a way to replace existing values with NaN

I'm experimenting with the algorithms in iPython Notebooks and would like to know if I can replace the existing values in a dataset with Nan (about 50% or more) at random positions with each column having different proportions of Nan values.
I'm using the Iris dataset for this experimentation to see how the algorithms work and which one works the best.
Thanks in advance for the help.
There is a replace function in python.
Link to answer

SAS- Put function used to convert bigint to char - generating incorrect ouptut

I need to access bigint data for processing. So i am trying to convert it into character and process it. However i noticed that some bigint numbers were not appearing in the output table. I tried doing a sample program listed below, that is replicating the issue.
data m;
x=put (57259186099807233,20.);
run;
proc print data =m;
quit;
Why does this give me output “57259186099807232”?
Thanks in advance.
NV
See http://support.sas.com/documentation/...
The largest integer value that can be 'safely' stored in SAS (on windows systems) is 9,007,199,254,740,992 which is smaller than the value in your example.
With bigger values, you will inevitably lose precision; fact which your example illustrates quite well.

salvaging Binary code in SAS

I am working with a large SAS database and it appears that a few of the columns were derived from binary sources and forced into a character format. I believe this is the case because all sorts of crazy characters are appearing in the column: ##, ??, the "female" symbol to name a few. Is there a way to salvage this data and convert in to a usable format, or do I need to correct the way the table is populated from the original data source.