Hi there
I'm starting to have a look at .ttx files created with Trados. To start with they have the same internal structure of null characters before (or is it after?) ASCII characters. I've found a better way of getting rid of them (better than the sed script I posted in another thread):
It can be seen that, except for the first two characters that should be stripped off, it is an xml file. I wonder if someone has got insights on the inner structure of this xml file. In the meantime I'll keep posting my findings.Code:pabloa:~/Development$ hexdump -C test_all.xml.ttx | head 00000000 ff fe 3c 00 3f 00 78 00 6d 00 6c 00 20 00 76 00 |..<.?.x.m.l. .v.| 00000010 65 00 72 00 73 00 69 00 6f 00 6e 00 3d 00 27 00 |e.r.s.i.o.n.=.'.| 00000020 31 00 2e 00 30 00 27 00 3f 00 3e 00 0d 00 0a 00 |1...0.'.?.>.....| 00000030 3c 00 54 00 52 00 41 00 44 00 4f 00 53 00 74 00 |<.T.R.A.D.O.S.t.| 00000040 61 00 67 00 20 00 56 00 65 00 72 00 73 00 69 00 |a.g. .V.e.r.s.i.| 00000050 6f 00 6e 00 3d 00 22 00 32 00 2e 00 30 00 22 00 |o.n.=.".2...0.".| 00000060 3e 00 3c 00 46 00 72 00 6f 00 6e 00 74 00 4d 00 |>.<.F.r.o.n.t.M.| 00000070 61 00 74 00 74 00 65 00 72 00 3e 00 3c 00 54 00 |a.t.t.e.r.>.<.T.| 00000080 6f 00 6f 00 6c 00 53 00 65 00 74 00 74 00 69 00 |o.o.l.S.e.t.t.i.| 00000090 6e 00 67 00 73 00 20 00 43 00 72 00 65 00 61 00 |n.g.s. .C.r.e.a.| pabloa:~/Development$ cat test_all.xml.ttx | tr -d "\0" >test.clean pabloa:~/Development$ hexdump -C test.clean | head 00000000 ff fe 3c 3f 78 6d 6c 20 76 65 72 73 69 6f 6e 3d |..<?xml version=| 00000010 27 31 2e 30 27 3f 3e 0d 0a 3c 54 52 41 44 4f 53 |'1.0'?>..<TRADOS| 00000020 74 61 67 20 56 65 72 73 69 6f 6e 3d 22 32 2e 30 |tag Version="2.0| 00000030 22 3e 3c 46 72 6f 6e 74 4d 61 74 74 65 72 3e 3c |"><FrontMatter><| 00000040 54 6f 6f 6c 53 65 74 74 69 6e 67 73 20 43 72 65 |ToolSettings Cre| 00000050 61 74 69 6f 6e 44 61 74 65 3d 22 32 30 31 31 30 |ationDate="20110| 00000060 36 31 30 54 31 38 34 33 35 38 5a 22 20 43 72 65 |610T184358Z" Cre| 00000070 61 74 69 6f 6e 54 6f 6f 6c 3d 22 53 44 4c 20 54 |ationTool="SDL T| 00000080 52 41 44 4f 53 20 54 61 67 45 64 69 74 6f 72 22 |RADOS TagEditor"| 00000090 20 43 72 65 61 74 69 6f 6e 54 6f 6f 6c 56 65 72 | CreationToolVer|
Cheers.
P.