Toxtree User Manual
Toxtree User Manual
Volume
IDEACONSULT
ToxtreeUserManual
I D E ACONS UL T
Ideaconsult Ltd.
4 Angel Kanchev St.
1000 Sofia, Bulgaria
Phone +359 886802011 • Email nina@acad.bg
Introduction
Toxtree is a full-featured and flexible user-friendly open source application, which is
able to estimate toxic hazard by applying a decision tree approach. Currently it includes
the following plug-ins:
• Verhaar scheme for predicting toxicity mode of actions (Verhaar HJM, van
Leeuwen CJ and Hermens JLM (1992) Classifying environmental pollutants. 1.
Structure-activity relationships for prediction of aquatic toxicity. Chemosphere
25, 471-491);
• A decision tree for estimating skin irritation and corrosion potential, based on
rules published in “The Skin Irritation Corrosion Rules Estimation Tool
(SICRET), John D. Walker, Ingrid Gerner, Etje Hulzebos, Kerstin Schlegel,
QSAR Comb. Sci. 2005, 24, pp. 378-384”;
• A decision tree for estimating eye irritation and corrosion potential, based on
rules published in “Assessment of the eye irritating properties of chemicals by
applying alternatives to the Draize rabbit eye test: the use of QSARs and in
vitro tests for the classification of eye irritation, Ingrid Gerner, Manfred
Liebsch & Horst Spielmann, Alternatives to Laboratory Animals, 2005, 33, pp.
215-237”;
1 http://ecb.jrc.it/documents/QSAR/EUR_23241_EN.pdf
1
T O X T R E E U S E R M A N U A L
• Structure Alerts for the in vivo micronucleus assay in rodents, based on the
rules, published in the document “Development of structural alerts for the in
vivo micronucleus assay in rodents”, by Romualdo Benigni, Cecilia Bossa,
Olga Tcheremenskaia and Andrew Worth3, European Commission report
EUR 23844 EN;
• Cramer rules with extensions: This plug-in is a copy of the original plug-in,
plus minor extensions. Like the Cramer plug-in, this plug-in works by assigning
compounds to Class I, II, or III, according to the rules from Cramer, and
some extra ones. Several compounds were classified by Munro in 19964 as
Class I or Class II compounds according to the Cramer rules, even though
Munro reported low NOEL values upon oral administration (indicating
relatively high toxicity). To overcome such misclassifications, five rules have
been introduced to capture the possible toxicity of these compounds;
Toxtree could be applied to datasets from various compatible file types. User-defined
molecular structures are also supported - they could be entered by SMILES, or by
using the built-in 2D structure diagram editor.
The Toxtree application is suitable for a standalone PC. It has been designed with
flexible capabilities for future extensions in mind (e.g. other classification schemes that
could be developed at a future date). New decision trees with arbitrary rules can be
built with the help of graphical user interface or by developing new plug-ins.
2 http://ecb.jrc.ec.europa.eu/DOCUMENTS/QSAR/QSAR_TOOLS/Toxtree_start_manual.pdf
3 http://ecb.jrc.ec.europa.eu/DOCUMENTS/QSAR/EUR_23844_EN.pdf
4 I.C. Munro, R.A. Ford, E. Kennepohl, and J.G. Sprenger, Correlation of structural class with No-Observed-
Effect Levels: A proposal for establishing a threshold of concern, Food Chem. Toxicol.34 (1996), pp. 829–
867.
2
T O X T R E E U S E R M A N U A L
Background
Cramer rules
The threshold of toxicological concern (TTC) is an open research topic with significant
practical implications. Two principal approaches exist in the thresholds developed to
date; a general threshold or a threshold developed in relation to toxicological data or
structural information. Thresholds based on structural information have typically been
developed by the principles established by Cramer. Chemicals are divided into three
structural classes based on a decision tree. This comprises some 33 structural rules and
places evaluated compounds into one of three classes:
• Class III substances are those that permit no strong initial presumption of
safety, or may even suggest significant toxicity or have reactive functional
groups;
The Cramer classification scheme was the first decision tree scheme, implemented in
Toxtree. If it is not possible to reduce the rule to a specific substructure search, an
extendable list of compounds is used as the input, for making a decision.
This plug-in is a copy of the original plug-in, plus minor extensions. Like the Cramer
plug-in, this plug-in works by assigning compounds to Class I, II, or III, according to
the rules from Cramer, and some extra ones. Several compounds were classified by
Munro in 1996 as Class I or Class II compounds according to the Cramer rules, even
though Munro reported low NOEL values upon oral administration (indicating
relatively high toxicity). To overcome such misclassifications, five rules have been
introduced to capture the possible toxicity of these compounds.
Verhaar scheme
This plugin implements the Verhaar scheme for predicting toxicity mode of action,
according to:
3
T O X T R E E U S E R M A N U A L
This plug-in is developed by Ideaconsult Ltd (Sofia, Bulgaria) within the AMBIT
project, funded by CEFIC LRI.
1. Ingrid Gerner, Kerstin Schlegel, John D. Walker, and Etje Hulzebosc, Use of
Physicochemical Property Limits to Develop Rules for Identifying Chemical
Substances with no Skin Irritation or Corrosion Potential, QSAR Comb. Sci.
2004, 23, pp.726-733
2. John D. Walker, Ingrid Gerner, Etje Hulzebos, Kerstin Schlegel, The Skin
Irritation Corrosion Rules Estimation Tool (SICRET), QSAR Comb. Sci.
2005, 24, pp.378-384
3. Etje Hulzebos, John D. Walker, Ingrid Gerner, and Kerstin Schlegel, Use of
structural alerts to develop rules for identifying chemical substances with skin
irritation or skin corrosion potential, QSAR Comb. Sci. 2005, 24, pp.332-342
Estimates eye irritation and corrosion potential by physicochemical property limits and
structural rules, according to:
Ingrid Gerner, Manfred Liebsch & Horst Spielmann, Assessment of the eye irritating
properties of chemicals by applying alternatives to the Draize rabbit eye test: the use of
QSARs and in vitro tests for the classification of eye irritation, Alternatives to
Laboratory Animals, 2005, 33, pp. 215-237.
This plug-in is developed by Ivanka Tsakovska and Nina Jeliazkova on behalf of JRC.
4
T O X T R E E U S E R M A N U A L
This plug-in is developed by Ideaconsult Ltd (Sofia, Bulgaria) within the AMBIT XT
project, funded by CEFIC LRI.
5
T O X T R E E U S E R M A N U A L
the classification results are displayed in a graphical user interface and can be
optionally saved as a file (CSV, SDF or TXT) file, together with the
classification path explanation (list of applied rules). This provides a
transparent audit of the decision path followed;
users may modify the tree by adding their own structural rules and save a
modified tree for future re-use;
the software provides a flexible and documented (in the doc/src application
subfolder) plug-in interface, allowing the integration of different classification
schemes developed at a future date;
the source code of the application is located in the src application subfolder;
Development tools
The Toxtree application is implemented in Java ™ . The basic cheminformatics
functionality relies on the open source LGPL licensed Java ™ library Chemistry
Development Kit (CDK). The Integrated Development Environment (IDE) Eclipse,
in conjunction with Apache Ant is the main development tool. Some of the Toxtree
capabilities are provided through the following open source libraries:
6
T O X T R E E U S E R M A N U A L
Launching Toxtree
In Windows ™ platforms, Toxtree can be launched either by using the “Start” menu
(Figure 1), or by double clicking on the Toxtree-X.YZ.jar file (the full path name is
“C:\Ideaconsult\Toxtree-vX.YZ\Toxtree\Toxtree-X.YZ.jar”).
5 http://openbabel.sourceforge.net/wiki/Main_Page
6 http://www.chembiogrid.org/cheminfo/smi23d/
7 http://www.l2fprod.com/
8 http://prefuse.org/
7
T O X T R E E U S E R M A N U A L
In all platforms (having Java ™ 2 Runtime, Standard Edition 1.5 or newer installed),
Toxtree can be launched by executing the following command (after decompressing
the ZIP archive distribution of Toxtree):
Please, note that in the above mentioned command “java” and Toxtree-
X.YZ.jar” should be eventually prefixed with the full path to java and Toxtree
on the destination platform.
The SMILES INPUT AREA can be used for entering a SMILES string.
Pressing the Go! button draws the structure diagram of the corresponding compound
in the COMPOUND STRUCTURE DIAGRAM AREA.
8
T O X T R E E U S E R M A N U A L
The << and >> buttons on the left hand side of the SMILES INPUT AREA are used for
navigating the history of entered SMILES’. This is also accessible also by means of a
drop-down list.
The FILE BROWSING AREA provides an easy way to navigate through the list of
compounds in the current open file.
The CLASSIFICATION AREA provides access to the Cramer classification results for
the current compound.
Pressing the Estimate button starts the classification routines for the current
compound (shown both in the COMPOUNDS PROPERTIES AREA and the
COMPOUND STRUCTURE DIAGRAM AREA).
The classification result is shown in graphical form (green highlight for class I, yellow
highlight for class II and red highlight for class III), as well as in text form.
The Verbose explanation checkbox determines the level of detail of the text-based
classification results.
9
T O X T R E E U S E R M A N U A L
CLASSIFICATION AREA
Opening/saving a file
Toxtree can open CML, CSV, HIN, ICHI, INCHI, MDL MOL, MDL SDF, MOL2,
PDB, SMI, TXT and XYZ file types.
You can open a supported type of file by using the “File ► Open” menu, as shown on
Figure 3. Molecules can be written to SDF, CSV or TXT files, together with their
classification data (class & path), by using the “File ► Save” menu.
10
T O X T R E E U S E R M A N U A L
11
T O X T R E E U S E R M A N U A L
12
T O X T R E E U S E R M A N U A L
13
T O X T R E E U S E R M A N U A L
o Default: Empty
o Default: Checked
o Default: Checked
o Default: Checked
9 http://openbabel.org/
14
T O X T R E E U S E R M A N U A L
o Default: helper/openbabel/win/babel.exe
o Default: helper/openbabel/linux/babel
o Default: helper
o Default: MOPAC_7.1.exe10
o Default: helper/smi23d/win/mengine.exe
o Default: helper/smi23d/linux/mengine
o Default: helper/smi23d/win/smi2sdf.exe
10 OpenMopac, http://openmopac.net/
11 mengine, http://www.chembiogrid.org/cheminfo/smi23d/
12 smi2sdf, http://www.chembiogrid.org/cheminfo/smi23d/
15
T O X T R E E U S E R M A N U A L
o Default: helper/linux/win/smi2sdf.exe
These general options are automatically saved in the toxtree.pref file, located in the
same directory as Toxtree-X.YZ.jar.
16
T O X T R E E U S E R M A N U A L
When the “Verhaar scheme” option is selected, the Verhaar scheme for predicting
toxicity mode of actions is loaded. The tree consists of 5 classes, as shown on Figure 8.
17
T O X T R E E U S E R M A N U A L
All of the functionality explained for the “Cramer rules” decision tree is valid for any
other tree selected.
The “Skin irritation/corrosion” decision tree for estimating skin irritation and
corrosion potential by physicochemical property limits and structural rules can be
loaded by selecting the corresponding line from the decision tree selection dialog
(Figure 7). The result of this operation is displayed in Figure 9.
18
T O X T R E E U S E R M A N U A L
For more details about skin irritation prediction, please consult the Skin
irritation/corrosion rules specifics section of the manual.
The “Eye irritation and corrosion” decision tree for estimating eye irritation and
corrosion potential by physicochemical property limits and structural rules can be
loaded by selecting the corresponding line from the decision tree selection dialog
(Figure 7). For more details about eye irritation prediction, please consult the Skin
irritation/corrosion rules specifics section of the manual.
Selecting the “Benigni / Bossa rulebase (for mutagenicity and carcinogenicity)” option
loads the corresponding decision tree as displayed in Figure 10.
19
T O X T R E E U S E R M A N U A L
For more details about mutagenicity and carcinogenicity prediction, please consult the
Benigni / Bossa rulebase (for mutagenicity and carcinogenicity) specifics
section of the manual.
the “New decision tree” submenu can be used for constructing a new decision
tree from scratch;
20
T O X T R E E U S E R M A N U A L
the “Select from list” submenu provides a list of known decision trees, which
can be loaded and subsequently edited in the “Decision tree editor”;
the “Load from file” submenu can be used for loading a user-supplied decision
tree in the “Decision tree editor”.
A copy of the chosen decision tree is loaded in memory for editing and can be
subsequently saved for future use.
You should always try to give a suitable (self-explaining) name, when saving
newly constructed from scratch or modified decision trees.
The “Decision tree editor” is loaded in a separate (child) window (Figure 12) and
provides tools for decision tree editing, as suggested by its title.
21
T O X T R E E U S E R M A N U A L
Decision tree area – placing the mouse on a tree node and right-clicking on it
provides access to a context menu. This menu can be used for rule editing, as
well as for modifying the “Yes” and/or “No” branches coming out of the
current node (highlighted in orange);
Decision node area – displays details about the currently selected node from
the Decision tree area. Left-clicking on a node in the Decision tree area
changes the current node.
Before exiting from the Decision tree editor, users should save the edited decision tree
by using the “File► Save” menu on the upper left corner of the main Decision tree
editor window. A reminder is displayed if the user tries to exit the Decision tree editor
without having saved his work.
22
T O X T R E E U S E R M A N U A L
Detailed instructions how to create and edit a decision tree are provided in section
“Decision tree editing - typical tasks”.
1. Load a file with chemical compounds (the following examples use the
DSSTox EPA Fathead Minnow file
EPAFHM_v3b_617_10Apr2006.sdf, downloaded from
http://www.epa.gov/ncct/dsstox/sdf_epafhm.html).
2. Run “Toxic Hazard► Estimate All” to apply Cramer rules for all
compounds.
23
T O X T R E E U S E R M A N U A L
The subsets selection dialog (Figure 13) displays three subsets, corresponding to
Cramer toxicity classes, and the number of compounds in each subset. Select the
second row (Intermediate (Class II)) and click OK. The subset of 22 compounds will
be loaded into Toxtree main window (Figure 14).
24
T O X T R E E U S E R M A N U A L
From this point on, all actions (e.g. File Save, Estimate All) are applied on the subset
loaded in the main window. Use the same menu to select another subset or to return to
the entire file. For the latest, select “All” options and then click OK (Figure 15).
25
T O X T R E E U S E R M A N U A L
Figure 15: Select “All” option to load the entire file into the main Toxtree window
The structure diagram editor provides a convenient way to edit the current compound
in Toxtree.
If needed, users could consult the structure diagram editor help and/or tutorial,
accessible through the “Help” menu of the editor.
26
T O X T R E E U S E R M A N U A L
Batch processing
When dealing with large datasets (more than 1000 molecules), Toxtree should be used
in batch processing mode. It can be accessed through the “File► Batch processing”
menu. Users are invited to select the input and output files (supported types are CSV,
TXT and SDF), before starting the batch processing. The batch can be paused or
aborted upon user request. Batch configuration/state can be saved and loaded at a later
time (Figure 17). In case of unexpected interruption (e.g. power failure, hardware
failure, operating system failure, etc…), the batch job can be continued from the last
previously fully processed record. In order to achieve this, the interrupted batch
configuration should be loaded (either from a user-specified file, or from the system
TEMP folder, where it is automatically stored if the user has not specified a file).
27
T O X T R E E U S E R M A N U A L
2. enter a molecular structure in the SMILES field (or open a file, containing
molecular structures - compatible file types are: CML, CSV, HIN, ICHI,
INCHI, MDL MOL, MDL SDF, MOL2, PDB, SMI, TXT and XYZ). Please,
note that CSV files can be read/written by MS Excel ™ ;
3. press "Estimate" (on the right upper part of the application window);
5. read the classification explanation (which Cramer rules have been applied);
6. go through the list of structures, loaded from a compatible file type (CML,
CSV, HIN, ICHI, INCHI, MDL MOL, MDL SDF, MOL2, PDB, SMI, TXT
and XYZ) by using the navigation links at the bottom of the left side of the
application window (note: CSV files can be read/written by MS Excel ™);
28
T O X T R E E U S E R M A N U A L
7. repeat steps from (3) to (5) for any chosen (in step (6)) molecular structure;
8. consult the decision tree (accessible through the Toxtree "Method ► View
decision tree" menu);
9. classify all the molecules, loaded from a file, by using the batch processing
facility;
10. save the processed molecules, together with classification data (class & path) in
a file (compatible types are CSV, SDF & TXT). Please, note that CSV files can
be read/written by MS Excel ™;
11. create (or edit existing) decision trees through the Toxtree "Method ► Edit
decision tree" menu;
12. change the decision tree used in the estimation through the Toxtree
"Method ► Select decision tree" menu;
13. modify the current molecule by using the integrated structure diagram editor
through the “Chemical Compounds ► Edit compound” menu.
The verbose console output could be redirected to a log file, by using the following
command:
29
T O X T R E E U S E R M A N U A L
You can check if the files bodymol.sdf and foodmol.sdf are found by Toxtree by using
the “Help► Files info” menu ( Figure 18).
Figure 18: Files information (bodymol.sdf & foodmol.sdf found and used by Toxtree)
If bodymol.sdf and/or foodmol.sdf are not found by Toxtree, the “Files information”
box reflects this, as shown on Figure 19.
Figure 19: Files information (bodymol.sdf & foodmol.sdf not found by Toxtree)
Please, note that the bodymol.sdf and foodmol.sdf files are provided mainly
as an example. They contain currently only a very limited number of
“Normal constituents of the body” and “Common components of food”
respectively, following an expert advice. Users should consider expanding
these files with appropriate molecules.
30
T O X T R E E U S E R M A N U A L
Hydrolysis/metabolic reactions
A limited number of hydrolysis (Cramer rules # 15, # 17, # 29, # 30 and # 31) and
metabolic (Cramer rule # 33) reactions are implemented, based on an expert advice.
The reactions are stored and read as CML files.
The implemented six hydrolysis reactions are shown on the following figures.
31
T O X T R E E U S E R M A N U A L
The implemented four metabolic reactions are show on the following figures.
32
T O X T R E E U S E R M A N U A L
33
T O X T R E E U S E R M A N U A L
The second rule verifies whether the LogP of a compound is within the [0,6] range.
LogP is calculated on the fly through the XlogP procedure13,14, implemented by the
CDK library. The implementation was validated by comparison with other LogP
implementations15.
The Verhaar scheme does not define structural rules for Class 4 compounds, but only
examples. The implementation includes only a limited set of examples.
• Not Corrosive
• Not Irritating
• Irritating
• Corrosive
• Irritating Or Corrosive
• Unknown
13R. Wang, Y. Fu, and L. Lai. A New Atom-Additive Method for Calculating Partition Coefficients. J. Chem.
Inf. Comput. Sci., 37:615–621, 1997.
R. Wang, Y. Gao, and L. Lai. Calculating partition coefficient by atom-additive method. Perspectives in
14
15 Uli Fechner, Kristina Grabowski, QA of the XlogP Descriptor, CDK News - The Newsletter of CDK
project, Volume 3/1, March 2004, ISBN 1614-7553, pp. 12-14, http://wiki.cubic.uni-
koeln.de/cdknews/index.php/CDKNews
34
T O X T R E E U S E R M A N U A L
Physicochemical properties
The Toxtree software calculates on the fly only LogP and Molecular weight, for the
rest it expects the values to be read from the file or manually entered by the user.
The file should contain properties with exactly the following names (column names in
CSV file of SDF properties in SDF file):
“Vapour Pressure”
“Water Solubility”
“Lipid Solubility”
“Melting Point”
“Surface Tension”
If fields with exactly these names are missing from the file, the software shows a dialog,
asking properties to be entered manually for each compound (Figure 30).
35
T O X T R E E U S E R M A N U A L
If the values are unknown, you might select the checkbox “Silent”. In this case, the
result of a silent rule asking for missing properties will always be “No” and therefore
the left branch of the tree will be followed. This effectively means that physicochemical
rules will be skipped and only structural rules will be applied. Please note that this may
result in a low quality prediction.
36
T O X T R E E U S E R M A N U A L
R41: Risk of serious damage to eyes (moderate but persistent eye lesions, eye
corrosion)
R35: Causes severe burns (skin corrosion caused by a 3-minute skin contact)
37
T O X T R E E U S E R M A N U A L
• Unknown category 11
CN CxHyOzNa
CNHal CxHyOzNaHalb
CNS CxHyOzNaSb
CHal CxHyOzHalb
38
T O X T R E E U S E R M A N U A L
a
m.p. – melting point; b l.s. – lipid solubility; c a.s. – aqueous solubility; d m.w. – molecular weight
The Toxtree software calculates on the fly only LogP and molecular weight, for the
rest it expects to be read from the file or manually entered by the user similarly to skin
irritation/corrosion rules.
If the values are unknown, you might select the checkbox “Silent”. In this case, the
physicochemical rules will be skipped and only the structural rules will be applied.
Please note that this may result in a low quality prediction.
The decision rules depend on the structural inclusion rules, given in Table 3, Table 4
and Table 5.
Table 3: Structural inclusion rules for predicting serious local lesions to the eye as implemented in Toxtree
(category 8)
39
T O X T R E E U S E R M A N U A L
R = H or
21 R N O Pyrrolidones aliphatic
chain
R
R
R=H or OH
R1=H or
R2
22 Substituted indoles aliphatic
N ketone
R
R2= any
R1
R R R R
N R N R
N R = H, NH2
N or aliphatic
23 H Substituted pyrazoles
chain
R1 R1 R1 = any
R1
a) b)
40
T O X T R E E U S E R M A N U A L
R1
+ +
NH3 N R2 R,R3=any; if
containing
R3 halogen, thio-
a) b) or sulpho-
Aromatic ammonium
24 groups then
salts
R R1 R R1 R41
R1,2 = H or
C N+ R2 C N+ R2 aliphatic
chain
R R3 R R3
c) d)
R = any
R SO 3H Organic sulphonic
25 (aliphatic or
salts
aryl group)
R R R2
N N
R S R1 R R1 R = any
S
R1 = H or -C-
a) b)
Thiazoles and any
26 R2
thiazolidines R2 = H or
N N aliphatic
chain
R1 R1
S S
c) d)
R
S S
N R
N R = aliphatic
27 R Thiazolones
chain
O O
a) b)
+ Triphenylphosphonium
28 P CH2 R R = any
salts
O R = aryl or
Organic phosphinic aliphatic
29 R P R1 acids and their chain
derivatives R1 = H or -
OH CH2-any
41
T O X T R E E U S E R M A N U A L
Table 4: Structural inclusion rules for predicting moderate reversible irritation to the eye as implemented in Toxtree
(category 9)
R1
+ +
NH3 N R2
R3 R,R3=any; if
a) b) containing
R R1
halogen, thio-
Aromatic
or sulpho
33 ammonium
C N +
R2 groups then
salts
R41
R R3 R1,2=H or
R R R1 aliphatic chain
C C N+ R2
R R R3
c) d)
42
T O X T R E E U S E R M A N U A L
Table 5: Structural inclusion rules for predicting skin corrosion as implemented in Toxtree (category 10)
C R2
O
Substituted benzoic acid Hal = Cl or F
34
halides R1-4 = any
R3
R4
O C N CH2 R
a)
35 Aliphatic iso(thio)cyanates R=aliphatic chain
S C N CH2 R
b)
R2
R1
R1-3 = any (e.g., further halogen)
36 Si Chlorosilanes
R3
Cl
H3C
O
37 H3C Si O C Mixed oxy-carboxysilanes R1,2 = any
R1
O R2
R1
R = aliphatic chain which may
38 R N Aliphatic amines contain ether functions
R2
R1,2 = H or aliphatic chain
R1
- + Alkali salts of aliphatic R1 = H or aliphatic chain
39 CH O Li
alcohols R2 = aliphatic chain
R2
43
T O X T R E E U S E R M A N U A L
All molecular descriptors for the QSARs are calculated on the fly by the software.
LogP is calculated through the XlogP procedure16,17, implemented by the CDK library.
The implementation has been validated by comparison with other LogP
implementations18. EHOMO and ELUMO are calculated by launching OpenMopac
16 R. Wang, Y. Fu, and L. Lai. A New Atom-Additive Method for Calculating Partition Coefficients. J. Chem.
R. Wang, Y. Gao, and L. Lai. Calculating partition coefficient by atom-additive method. Perspectives in
17
18 Uli Fechner, Kristina Grabowski, QA of the XlogP Descriptor, CDK News - The Newsletter of CDK
project, Volume 3/1, March 2004, ISBN 1614-7553, pp. 12-14, http://wiki.cubic.uni-
koeln.de/cdknews/index.php/CDKNews
44
T O X T R E E U S E R M A N U A L
7.119. An additional validation step was performed by comparing the results with the
original values presented by the authors of the QSARs.
In order to reproduce the values of the descriptors present in the original papers, the
following correction factors are applied to the calculated values (the corrected values
are used for predictions):
QSAR13
QSAR6
QSAR8
19 Available at http://openmopac.net/Downloads/MOPAC_7.1executable.zip
45
T O X T R E E U S E R M A N U A L
• Yes: The QSAR will be calculated for the current chemical, and the question
dialog will appear for each subsequent chemical
46
T O X T R E E U S E R M A N U A L
• No: The QSAR will not be calculated for the current chemical, and the
question dialog will appear for each subsequent chemical
• Yes to all: The QSAR will be calculated for the current chemical and for all
subsequently processed chemicals, for which it is applicable. The question
dialog will not appear anymore, unless the option is changed via the
“Method ► Decision Tree Options” menu.
• No to all: The QSAR will be not calculated for the current chemical and for
all subsequently processed chemicals, for which it is applicable. The question
dialog will not appear anymore, unless the option is changed via the
“Method ► Decision Tree Options” menu.
The rule options can also be accessed through the Rules section of the
“Method► Decision Tree Options ” menu (Figure 33).
47
T O X T R E E U S E R M A N U A L
1. Use the “Method ► Edit Decision tree ► New Decision Tree” menu option to
create a new (empty) decision tree. This will launch the decision tree editor, as
shown on Figure 34.
2. Click the “Decision rules ► New Rule” menu option to create a new rule. This
will launch the decision rule wizard, as shown on Figure 35.
48
T O X T R E E U S E R M A N U A L
3. Click on the “Aromatic” option to create a rule that will verify if the substance
is aromatic. Then click OK. The next page of the wizard will appear, as shown
on Figure 36.
49
T O X T R E E U S E R M A N U A L
4. Click on the “Rule ID” field, and type in “Q1”. This will be used as a rule
identifier. You could also modify the rule’s title, explanation and examples.
When finished, click OK. The rule will be added as a top node of the tree as
shown on Figure 37. Clicking on the rule will highlight it with orange colour
and will display decision node details on the left and decision rule details at the
bottom.
50
T O X T R E E U S E R M A N U A L
Figure 37: Decision tree editor – new rule added as a top node
5. Click on the “Categories” tab. This will show a list of categories used so far in
the tree. Click on “Default class 2” (second row), as shown on Figure 38.
51
T O X T R E E U S E R M A N U A L
6. Then click on “Categories ►E dit category” menu option on the left. This will
launch the category editor, as shown on Figure 39. Type in “Not aromatic” as
category title and similar text in the explanation field. Click OK when ready.
52
T O X T R E E U S E R M A N U A L
7. Right click on the decision node in order to invoke a popup menu, as shown
on Figure 40. These menu options allow editing the rule and modifying the left
and right branches. The left branch is followed when the answer of the rule is
“No” and the right branch is followed if the answer is “Yes”.
8. Proceed with modifying the right branch, which in the current tree will be
followed when the structure is aromatic. Click on the “Modify <YES>
53
T O X T R E E U S E R M A N U A L
branch” menu option. This will launch a wizard, as shown on Figure 41. Select
the “Rule” option and click “Next”.
9. The next wizard page shows the same options as in Figure 35. Click on the
“Heteroaromatic” option and follow the wizard’s instructions. At the end a
new node will be added to the tree, as shown on Figure 42.
54
T O X T R E E U S E R M A N U A L
Figure 42: Decision tree editor – decision tree with two nodes
10. Proceed with modifying the left branch, which in the current tree will be
followed when the structure is aromatic but not heteroaromatic. Right click on
the new node and invoke the “Modify <NO>branch” menu option. The
same wizard as in Figure 41 will appear. Select the “Category” option and click
“Next”.
55
T O X T R E E U S E R M A N U A L
11. Select the “Create new category” from the three options, shown on Figure 43,
and click “Next”. A category editor will appear, as in Figure 39. Type in
“Aromatic, but not heteroaromatic” as a title and “The structure is not
heteroaromatic” in the explanation field. Then click OK. This will change the
left branch of the second node.
12. Finally, we will modify the result that will be assigned if the structure is
heteroaromatic. Right click on the second (Heteroaromatic) node and invoke
the “Modify <YES>branch” menu option. The same wizard as in Figure 41
will appear. Select the “Category” option and click “Next”.
13. From the category wizard (Figure 43) select the first option – “Select from
categories used in this tree” and click “Next”. The next page displays the
categories used in the tree. Select “Default class 1”, as shown on Figure 44, and
follow the wizard’s instructions.
Figure 44: Decision tree editor – list of categories used in the tree
The next page will be a category editor as in Figure 39. Type in “Heteroaromatic” in the
title field. Click OK when ready. The resulting tree should look like as shown on Figure
45 (2 nodes, 3 categories).
56
T O X T R E E U S E R M A N U A L
Figure 45: Decision tree editor – the resulting tree (2 nodes, 3 categories)
14. Use the “Decision Tree ► Modify tree caption” menu option on the left. This
will ask you to enter the tree’s title. Type in “Aromatic substances” and click
OK.
15. Save the file as *.tree or *.tml file and close the decision editor. You might then
load the tree by “Method ► Select Decision Tree” menu option Figure 46.
The *.tree file is a binary file and normally .tree files are incompatible
between releases. A new XML based format *.tml was introduced, in order to
avoid incompatibility of .tree files. The *.tml file is the preferred format,
while *.tree format is likely to be abandoned in subsequent releases.
57
T O X T R E E U S E R M A N U A L
16. The decision tree will be loaded as a default decision method in Toxtree. The
classification area will reflect the categories, defined in the tree (Figure 47).
58
T O X T R E E U S E R M A N U A L
Figure 47: The new “Aromatic substances” decision tree displayed on the main Toxtree screen
1. Apply the decision tree, created in the previous section, to the default structure
(hexane) by clicking on the “Estimate” button. This applies the decision tree to
the current structure. The result says that the compound is not aromatic, as
shown on Figure 48. The result is also assigned as a property of the compound
and will be saved along with other properties when using the “File► Save”
menu option.
59
T O X T R E E U S E R M A N U A L
60
T O X T R E E U S E R M A N U A L
3. Click again on the “Estimate” button. This will apply the decision tree to the
new structure. The result is shown on Figure 50.
61
T O X T R E E U S E R M A N U A L
Figure 50: The result of applying the new tree to the new structure
62