ProtoPRED®

Q: What is a QSAR model?

A: A Quantitative Structure-Activity Relationship (QSAR) model is a predictive model that relates the structure of a chemical (encoded in a series of numerical descriptors) to one of its properties. QSAR models use pre-existing experimental data, analysed through machine-learning algorithms, to establish this relationship. These models are statistically validated with separate training and validation datasets to ensure predictability.

Q: Where to learn more about QSAR?

A: There is extensive literature on the technique. More information can be found on our website. Additionally, in-house courses on developing and applying QSAR and other computational techniques can be found at ChemoPredictionAcademy.

Q: Are QSAR models accepted for regulatory purposes?

A: Yes. Most regulatory frameworks not only accept QSAR models but also encourage their use as an alternative to animal-based tests.

Q: Can I predict any substance in ProtoPRED^®?

A: No, except for ProtoNANO, the models are developed to work with organic discrete chemicals. Thus, any chemical containing metals or other elements not common in organic molecules is not accepted. Additionally, mixtures and salts are not directly usable in ProtoPRED^®. In such cases, the user can predict the properties separately for individual organic ions or components.

Q: There is a QSAR model in two different modules. Are they different?

A: Some QSAR models are available across multiple modules, as they fall under different classifications. For example, the boiling point is a REACH requirement, but also a physico-chemical property; therefore, the corresponding model is included in both the ProtoREACH and ProtoPHYSCHEM modules. The QSAR model is the same, so the results are the same, but some information included in the PDF reports may be slightly different, as those are tailored for the particular application and/or include combined reports which consider additional information.

Cost / Licenses

Q: Is ProtoPRED^® free of charge?

A: ProtoPRED^® is a professional platform with commercial models. However, registration is completely free of charge. Once registered, users receive an initial number of free tokens, which allows access to all platform features.

Q: What are the tokens?

A: ProtoPRED^® tokens are the units used by the server to manage the credit. Each time a prediction is made or a PDF report is generated, a specific number of tokens is deducted from the user's account.

Q: What happens when the free tokens are used up?

A: Once the free tokens are used up, additional token packages can be purchased directly from the online store. Simply visit our website and select the package that best suits your needs. For academic use or higher-volume prediction requirements, please contact us at bd@protoqsar.com.

Data privacy

Q: Which information does ProtoPRED^® store?

A: ProtoPRED^® stores information of the user in 4 different forms:

Personal information provided during the registration. This information is stored by ProtoQSAR SL, but it is not used for any other purpose than provide the service (unless the permission is explicitly given during the registration process).
Historic of the movements. All the movements done in the server that affect the number of tokens (such as predictions, generation of reports, credit purchases) can be consulted in the user page to ensure proper record of the use of the server. By default, this historic includes data such as the SMILES and the predicted value to facilitate user revision, but this can be changed in the user section.
Last prediction. Complete results of the last prediction of each module are also stored internally, to allow the user to recover the data in case of accidental closure, connectivity issues or any other interruption. This can also be changed in the user section to ensure data privacy.
QPRF data. To simplify the filling of the author data section in the QPRF (obligatory), default values are offered for the user name, address, phone and email. Those fields can be edited in the user page. For each PDF request, the current values can be stored as new defaults.

Q: What can be done if a user does not want ProtoPRED^® to store any information related to their molecules?

A: In the user page, there is a Reports data section to modify the QPRF author data. This form includes a check box for Data Privacy. If this is selected, all the information about the results will be removed, except for the model used and the date. Note that the data is really erased and cannot be recovered. This also will deactivate the option to recover the results of the last prediction for each module.

Q: Can personal data be modified or eliminated?

A: Yes. For queries about personal data, ProtoQSAR data representatives can be contacted at gdpr@protoqsar.com.

Reporting of results

Q: How are the results of a model reported in ProtoPRED^®?

A: ProtoPRED^® produces four different types of output in the results page: a summary table with all the predictions performed; a model-specific table with additional details for each different model; a downloadable Excel spreadsheet (with more information such as the values in the internal model units); and downloadable standardised PDF reports (this could have an additional cost) such as QPRFs or specific combined reports, such as in ProtoICH and GenoITS.

Q: Why is a predicted value N/A?

A: ProtoPRED^® uses calculated molecular descriptors to characterise your substance. In some cases, these descriptors cannot be calculated due to numerical issues or incompatibilities between the structure and the descriptors. When this occurs, ProtoPRED^® uses imputation to estimate the values from our database. However, this is only permitted for a limited number of descriptors. If an excessive number of descriptors require imputation, the prediction will be considered unreliable and will not be provided. In such cases, neither the prediction nor the QPRF would be charged.

Q: What is the Applicability Domain?

A: QSAR models are trained using a database of molecules with experimental values, so their validity is restricted to the chemical space defined by this dataset. ProtoPRED^® uses four different techniques to evaluate if a target molecule falls inside the chemical space of a model. Although a prediction will be provided, those outside the applicability domain are considered unreliable. In such cases, neither the prediction nor the QPRF would be charged.

Q: How is the reliability score calculated?

A: Assessing the reliability of a calculation implies factors such as the overall predictivity of the model, the relationship of the target molecule with the training set, or the behaviour of the model with similar substances. An explanation of the parameters used and their values are provided in the QPRF. Note that this score is presented as a guidance and does not substitute a tailored expert assessment considering the details of each factor included in the QPRF.

Q: The QPRF includes information of analogues. Are they used for the prediction?

A: No, analogues do not contribute directly to the prediction. However, they do influence the reliability score of the prediction.

Q: Can the QPRF be modified?

A: Content for various sections of the QPRF can be provided, such as those regarding personal data, the identity of the substance, and additional comments in different sections. However, it is not possible to modify the existing content directly.

Support/Technical issues

Q: What should be done if a calculation does not finish?

A: Retrying the operation may solve the issue. If it persists, please contact the support team at protopred@protoqsar.com.

Q: The screen has been closed or the connection has been lost before saving the results. Can they be recovered?

A: Yes. Using the Retrieve the results from the last calculation button of each module, the results can be recovered (located above the INPUT MOLECULE(S) section). Note that this option will not work if the Data privacy option is selected in the reports data section of the user page.

Q: What should be done if purchased tokens are not added to the account?

A: The tokens should be added to the account name provided in the purchase form within 24 hours. A confirmation email should be received once the account has been updated. If the tokens do not appear, contact protopred@protoqsar.com with a copy of the purchase confirmation.

ProtoICH

Q: Why does ProtoICH apply 3 different models?

A: The ICH M7 guideline requires an integrated computational approach, where a statistical and an expert-rule method are combined. Additionally, a profiler is applied to detect the species considered part of the carcinogenicity cohort of concern, including aflatoxins, N-nitrosamines and azoxy compounds.

ProtoREACH

Q: Can endpoints be selected according to the REACH annex?

A: ProtoPRED^® includes buttons to automatically select the endpoints required for each of the Annexes in REACH, based on the substance's tonnage. However, this serves as general guidance, as the exact requirements may vary depending on existing data for these parameters or other properties.

GenoITS

Q: What is an ITS?

A: An Integrated Testing Strategy (ITS) is a substance assessment approach that involves the selection and prioritisation of assays. This strategy aims to reduce or replace unnecessary experiments by making use of all available information during the assessment process.

Q: Does GenoITS provide a prediction of a compound's genotoxicity?

A: No. GenoITS does not provide a prediction of genotoxicity, but rather a conclusion based on the results of the five studies required under REACH: in bacteria mutagenicity, in vitro and in vivo mutagenicity, and in vitro and in vivo cytogenicity. This information may be supplied by the user or derived from computational models, such as the QSAR models available through the ProtoPRED® platform.

Q: Do the results generated by the GenoITS module have regulatory validity?

A: Yes. The QPRF is the format recommended under REACH for reporting QSAR predictions, and it carries regulatory validity. GenoITS generates QPRFs for all models used within the module. Additionally, GenoITS provides a supplementary 'GenoITS report', which does not hold regulatory validity on its own.

Q: Should the REACH annex be specified?

A: Yes. The genotoxicity ITS recommended under REACH is annex-dependent; therefore, specifying the annex is mandatory in order to run the GenoITS module and obtain a conclusion.

Q: The GenoITS module has been executed, but no conclusion has been reached. Why might this occur?

A: In some cases, the compound under evaluation may fall outside the applicability domain of one or more of the models used, preventing a conclusion from being reached. In such instances, the report indicates which alternative study should be conducted by the user, along with the potential final conclusion depending on the outcome of that study.

Q: How many tokens are required to evaluate a compound?

A: The cost for performing the genotoxicity assessment of a compound is fixed at 20 tokens, regardless of the REACH annex selected. Additionally, the results and QPRFs for all five models included in the module are provided, irrespective of whether they are used to reach the final conclusion.