Daniel Lowe

28 posts

Daniel Lowe

Daniel Lowe

@dan2097

Katılım Ocak 2010
3 Takip Edilen27 Takipçiler
Daniel Lowe
Daniel Lowe@dan2097·
@cdsouthan No, there is not. I think the most common cause of "Too many radicals for esters" would be erroneous additional whitespace
English
0
0
0
17
Christopher Southan
Christopher Southan@cdsouthan·
. @dan2097 (or anyone) Is there an index of #opsin IUPAC failure codes with possible fixes? I can correct obvious typos/whitespace issues but things like "locant 7 unsaturated" or "Too many radicals for esters" not obvious to fix
English
1
0
0
118
Daniel Lowe
Daniel Lowe@dan2097·
@iChemLabs Incidentally (Optical) Chemical Structure Recognition, abbreviated OCSR or CSR, is another common way of describing this without calling the process "OCR"
English
0
0
1
0
Daniel Lowe
Daniel Lowe@dan2097·
@houndcl In a real-world setting you could potentially also get improvements by processing all images from a given document and preferring the interpretation that gave more groups in common with the other images, as often a series of related compounds is synthesized
English
0
0
0
0
Daniel Lowe
Daniel Lowe@dan2097·
@houndcl Many compounds will be in PubChem, so there are real-world benefits, but I still wonder whether a fragment-based approach could be more general e.g. for documents with novel compounds; likely no advantage for the Kaggle comp, but probably within their new restrictions.
English
1
0
0
0
Daniel Lowe
Daniel Lowe@dan2097·
@jwmay @uspto @ReedTechLifeSci They appear to now have switched their naming scheme of _oldest.tar _old.tar, no suffix to no suffix, _r1.tar and _r2.tar. In the case I looked at the re-released files had a supplementary information zip that was missing from the original
English
0
0
2
0
Daniel Lowe
Daniel Lowe@dan2097·
@ChemConnector @markussitzmann @baoilleach I'm sure @baoilleach will correct me if I'm wrong, but I think LeadMine's current code for NMR extraction for the most part comes from that project. It's unfortunate that the next step with MestreLab, to do the NMR prediction to verify the spectra, didn't come to fruition.
English
0
0
0
0
ChemConnector
ChemConnector@ChemConnector·
@markussitzmann @baoilleach Your hypothesis may be proven correct, or incorrect, don't know until the work is done. ANd that's exactly the work I wanted to do but we never made progress. However, I STILL want to do it if there is interest.
English
1
0
0
0
Noel O'Boyle (@baoilleach@mstdn.science)
Just been counting up the number of molecule/NMR spectrum pairs LeadMine extracts from US patent applications: ~500K. Entries like: 1H NMR (500 MHz, DMSO-d6) δ 8.94 (d, J = 7.3 Hz, 1H), 8.39 (s, 1H), 8.32-8.23 (m, 2H), 7.73 (t, J = 7.9 Hz, 1H), 7.10 (dd, J = 7.3, 5.2 Hz, 1H),...
English
1
2
5
0
Daniel Lowe
Daniel Lowe@dan2097·
@egonwillighagen @baoilleach Are you eluding to the Experimental Data Checker which tried to check whether a textual spectrum was plausible by comparison with the structure?
English
0
0
1
0
Daniel Lowe
Daniel Lowe@dan2097·
@cdsouthan The issue is actually the forward slash, quite a few web servers (including this one) don't correct handle encoded slashes, ostensibly for security reasons. I should probably send requests in a different way to workaround this issue.
English
0
0
0
0
Daniel Lowe
Daniel Lowe@dan2097·
@baoilleach You can also encounter mentions of Tyr(O-Me), which possibly makes the reason for the O clearer, I think it's to indicate which atom on the tyrosine is the substitution point, but given the normal meaning of an O in a formula this just ends up being even more unclear
English
0
0
1
0
Noel O'Boyle (@baoilleach@mstdn.science)
Peptide scientists, please decide: is it Phe(4-OMe), Tyr(Me) or my favourite, Tyr(OMe)? Fortunately, Tyr(Me) appears to be much preferred in PubMed Abstracts. This follows the rule of minimising the substitutent size.
English
1
0
0
0
Daniel Lowe
Daniel Lowe@dan2097·
@rmcgibbo @baoilleach If there are specific cases where ChemAxon works, but OPSIN doesn't I'm happy to look into them.
English
0
0
1
0
Daniel Lowe
Daniel Lowe@dan2097·
@i_vishalll @cdsouthan Apologies for the issue, OPSIN was moved to a new server earlier this week that was not configured quite right for JNI-InChI. This is now resolved.
English
1
0
0
0
Daniel Lowe
Daniel Lowe@dan2097·
@eawRDM InChI is the answer to the question...but if I wanted to use the chemical structure I would also want a format that more precisely captured the structure. As @baoilleach commented, the InChI/InChIKey can be generated precisely from the SMILES, but not vice versa.
English
1
0
0
0
Daniel Lowe
Daniel Lowe@dan2097·
@cdsouthan Will try to fix as it's not even an experimental section! Tricky in general as you don't want want to find fragments of name
English
1
0
0
0