Splitting Characters in A Cell Array
3 views (last 30 days)
Show older comments
Hi All,
I am trying to split some content in a cell array into separate portions. I've tried converting to a string and using strsplit, but I am not getting the results I want because of the datatype syntax.
Came across the cellfun command, but not really sure how to implement it.
Here is what I have
'P245/65R17 105S'
'P265/70R16 111S'
'P275/55R20 111H'
'285/60R18 120H'
'P235/70R17 108S'
What I need:
'P245/' '65' 'R' '17' '105' 'S'
'P265/' '70' 'R' '16' '111' 'S'
'P275/' '55' 'R' '20' '111' 'H'
'285/' '60' 'R' '18' '120' 'H'
'P235/' '70' 'R' '17' '108' 'S'
Thanks in advance!
0 Comments
Accepted Answer
Jan
on 11 Nov 2015
Data = {'P245/65R17 105S'; ...
'P265/70R16 111S'; ...
'P275/55R20 111H'; ...
'285/60R18 120H'; ...
'P235/70R17 108S'};
n = numel(Data);
Result = cell(n, 6);
for k = 1:n
S = Data{k};
p = strfind(S, '/');
% 'P245/65R17 105S'
% 'P245/' '65' 'R' '17' '105' 'S'
Result(k, :) = {S(1:p), S(p+1:p+2), S(p+3), S(p+4:p+5), S(p+7:p+9), S(p+10)};
end
Does this help already? Or do strings appear, which do not match this pattern? If so, you can search for the space also, use the length of the strings or whatever.
5 Comments
Jan
on 13 Nov 2015
@Aldrich: The shown result cannot be represented in Matlab. If it is stored as a cell string, the missing elements must be at least [], because an array must have the same number of elements per row.
More Answers (1)
Guillaume
on 11 Nov 2015
Edited: Guillaume
on 11 Nov 2015
data = {'P245/65R17 105S';
'P265/70R16 111S';
'P275/55R20 111H';
'285/60R18 120H';
'P235/70R17 108S'};
splitdata = regexp(data, '(.+/)(\d+)([A-Z])(\d+) (\d+)([A-Z])', 'tokens', 'once');
splitdata = vertcat(splitdata{:})
The regular expression is divided into tokens (the () in the regex)
- the 1st token is one or more (the +) character (the .) followed by '/'
- the 2nd token is one or more (the +) digit (the \d)
- the 3rd token is a single character between A and Z (the [A-Z])
- 4th token, see 2nd
- it then matches a space which is not part of any token
- 5th token, see 2nd
- 6th token, see 3rd
3 Comments
Guillaume
on 13 Nov 2015
A regex that would most likely work with all your cases would be
regexp(data, '([A-Z]*)(\d+)(/)(\d+)([A-Z])(\d+) (\d+(/\d+)?)?([A-Z])', 'tokens')
See Also
Categories
Find more on Characters and Strings in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!