Clear Filters
Clear Filters

Not able to make dummy variables

1 view (last 30 days)
Malene Brautaset
Malene Brautaset on 19 Mar 2022
Commented: Voss on 25 Mar 2022
Hello.
We have a problem making dummy variables since our variables has ' ' around them ('M'). We have a column with 'M' and 'F' and have to make dummy variables in order to use it in our regression.
How do we fix this?

Answers (1)

Voss
Voss on 19 Mar 2022
Edited: Voss on 19 Mar 2022
C = readcell('data.csv')
C = 4×6 cell array
{'pcid' } {'year'} {'currency'} {'gender'} {'revenue'} {'netIncome'} {[887700702]} {[2010]} {''NOK'' } {''F'' } {[9837000]} {[ 533000]} {[887704902]} {[2010]} {''NOK'' } {''M'' } {[ 757000]} {[ -224000]} {[887705992]} {[2010]} {''NOK'' } {''M'' } {[6631000]} {[ 681000]}
dummy_gender = strcmp(C(2:end,4),'''M''')
dummy_gender = 3×1 logical array
0 1 1
  2 Comments
Malene Brautaset
Malene Brautaset on 25 Mar 2022
Thank you! This worked. But however, we want to keep our dataset in a table; is there a code we can use for the original dataset in a table?
Voss
Voss on 25 Mar 2022
You're welcome!
Sure, here is a table version:
T = readtable('data.csv')
T = 3×6 table
pcid year currency gender revenue netIncome __________ ____ _________ _______ _________ _________ 8.877e+08 2010 {''NOK''} {''F''} 9.837e+06 5.33e+05 8.877e+08 2010 {''NOK''} {''M''} 7.57e+05 -2.24e+05 8.8771e+08 2010 {''NOK''} {''M''} 6.631e+06 6.81e+05
dummy_gender = strcmp(T{:,'gender'},'''M''')
dummy_gender = 3×1 logical array
0 1 1

Sign in to comment.

Products

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!