Codename: cxsmiles,cxsmarts
SMILES_String |<feature1>,<feature2>,...|
In extended smiles export the following additional features are exported:
The relative stereoconfiguration is stored as "r". The absolute stereoconfiguration is the default, which is not marked. (Absolute stereoconfiguration known also as "Chiral flag" in MDL molfiles.)
The following stereochemical group types are stored:
Atom labels / aliases are written between "$" characters each label is separated by ";" characters.
Atom indexes relating to wiggly bonds are written after "w:" followed by a dot character and the wiggly bond index. The wiggly bonds are separated by commas.
Bond indexes of the double bonds in SSSR are written.
The bond stereo information is generated as the following:
the double bond has the representation a1-a2=a3-a4, where
Atom indexes with local ODD parity are written after "@:", while atom indexes with local EVEN parity are written after "@@:" characters separated by commas.
Atom indexes with
The indexes of the atoms having lone electron pairs are written after "LP:".
The multicenter atom indexes written after "m:" followed by a colon character and the indexes of the atoms which forms the given SGroup separated by ".". The SGroups are separated by commas.
Atom indexes relating to coordinate bond indexes are written after "COORDINATE:" followed by a dot character and the coordinate bond index. The coordinate bonds are separated by commas.
Example: "m:0:7.6.5.4.3,2:12.11.10.9.8,COORDINATE:0.0,2.1"The link node atom indexes are written after "LN:" followed by a colon character, the minimum repetitions, maximum repetitions, the node first and second outer atom indexes separated by ".". If the link node has only two connections, then the first and second outer atom indexes are obvious, so they are omitted. The link nodes separated by commas.
Example: "LN:1:1.5.3.0,6:1.2.7.5,9:1.10.10.8"
The atomic coordinates are written between parentheses.
Each atomic coordinate triplet (x, y, z) is separated by semicolon, and the
x y z coordinates are separated by commas. Zero coordinates are omitted.
Note: The CIS/TRANS information is rednundant in this case. It is specified
in the SMILES string and also in the atomic coordinates. The
atomic coordinates has priority over the SMILES string.
Export options can be specified in the format string. The format descriptor
and the options are separated by a colon.
All options have default values (see below).
Using the "+" or "-" sign the default export values
can be changed to "true" or "false" respectively. If the option is given without "+" or "-" modifier than the
default values are not used and only the specific feature is exported.
Examples:
"cxsmiles:" writes all default features
(absolute stereoconfiguration, enhanced
stereo features, atom labels, wiggly bond indexes, ring stereo bond info and
reaction fragment level grouping),
"cxsmiles:lc" writes the atom labels and the atomic coordinates only,
"cxsmiles:+c" writes writes all default features and the atomic coordinates,
"cxsmiles:-le" writes absolute stereoconfiguration, enhanced
stereo features, ring stereo bond info and
reaction fragment level grouping but not atom labels and
wiggly bond indexes.
See also SMILES export options and basic export options.