How to parse a file with value pairs

4 views (last 30 days)
Hi was looking for ideas on how to best parse a text file which lines are in the following format
!FIX.BODY=xxxxx|2125=aaaaaa|1067=bbbbb|150=ccccc|329=ddddd|
the line is delimited by a '|' and it represents a value pair with the column name followed by an equal sign = followed by the value.
So in the example above I would like to end up with a table like this
FIX.BODY 2125 1067 150 329
___________________________________________
xxxxx aaaaaa bbbbb ccccc ddddd
  3 Comments
Ruben Ruiloba
Ruben Ruiloba on 25 Aug 2020
Hi thanks for the reply. Yes the file has multiple lines and the lines don't all have the same number of variable names.
The lines can differ. I can't upload the file as it has confidential data but it would look something like this.
!FIX.BODY=xxxxxx|2125=aaaaaa|1067=bbbbb|150=ccccc|329=dddddd|
!FIX.BODY=xxxxxx|2125=akcklsd|150=cchscc|329=ddddsd|
!FIX.BODY=xxxxxx|2125=ajfalkjfa|4555=ndlsuel|908=akncld|123=hdeudnc|
Thanks
Stephen23
Stephen23 on 25 Aug 2020
Edited: Stephen23 on 25 Aug 2020
"I can't upload the file as it has confidential data but it would look something like this."
A sample file does not have to contain confidential data in it, because you can write it with random, invented, made up data. However it should include all of the salient features of the actual data files, such as EOL character/s, file encoding (especially if this file is being generated from some other application), representative character strings, number encodings, etc.
Providing a file gives us an agreed reference with which we can test our code.
Not providing a file slows down you getting the solution you want.

Sign in to comment.

Accepted Answer

Stephen23
Stephen23 on 25 Aug 2020
Edited: Stephen23 on 25 Aug 2020
This works with the attached file (which I had to create myself):
T = table();
[fid,msg] = fopen('temp0.txt','rt');
assert(fid>=3,msg)
while ~feof(fid)
str = fgetl(fid);
spl = regexp(str,'[^!|=]+','match');
vnm = genvarname(spl(1:2:end)); % GENVARNAME is not required for R2019b or later
T{end+1,vnm} = spl(2:2:end); %#ok<SAGROW>
end
fclose(fid);
Giving:
>> T
T =
FIX0x2EBODY x2125 x1067 x150 x329 x4555 x908 x123
___________ ___________ _______ ________ ________ _________ ________ _________
'xxxxxx' 'aaaaaa' 'bbbbb' 'ccccc' 'dddddd' [] [] []
'xxxxxx' 'akcklsd' [] 'cchscc' 'ddddsd' [] [] []
'xxxxxx' 'ajfalkjfa' [] [] [] 'ndlsuel' 'akncld' 'hdeudnc'
  1 Comment
Ruben Ruiloba
Ruben Ruiloba on 25 Aug 2020
Thanks this has given me ideas and exactly what I was looking for. Thanks again.

Sign in to comment.

More Answers (0)

Categories

Find more on Data Type Conversion in Help Center and File Exchange

Products


Release

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!