mastodon.world is one of the many independent Mastodon servers you can use to participate in the fediverse.
Generic Mastodon server for anyone to use.

Server stats:

8.1K
active users

#Cheminformatics

0 posts0 participants0 posts today
Charles Tapley Hoyt<p>currently preparing slides to present at the sixth <span class="h-card" translate="no"><a href="https://nfdi.social/@NFDI4Chem" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>NFDI4Chem</span></a></span> consortium meeting on using the <span class="h-card" translate="no"><a href="https://hachyderm.io/@bioregistry" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>bioregistry</span></a></span> for finding ontologies, standardizing cross-references, and supporting RDM</p><p>also thinking about how to automate generating slides that summarize the state of the database for future talks and reminded of <a href="https://xkcd.com/1205" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">xkcd.com/1205</span><span class="invisible"></span></a></p><p><a href="https://scholar.social/tags/bioinformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bioinformatics</span></a> <a href="https://scholar.social/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://scholar.social/tags/semanticweb" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>semanticweb</span></a> <a href="https://scholar.social/tags/linkeddata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linkeddata</span></a> <a href="https://scholar.social/tags/FAIR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FAIR</span></a> <a href="https://scholar.social/tags/rdm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>rdm</span></a> <a href="https://scholar.social/tags/fdm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>fdm</span></a> <a href="https://scholar.social/tags/openscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openscience</span></a></p>
Andrew Dalke<p>Any idea how to get the (<a href="https://toots.nu/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a>) structures for a <a href="https://toots.nu/tags/ChEBI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ChEBI</span></a> ontology term? Like, how do I get all the terpenoids, CHEBI:26873?</p><p><a href="https://www.ebi.ac.uk/ols4/ontologies/chebi/classes/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FCHEBI_26873?lang=en" rel="nofollow noopener" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">ebi.ac.uk/ols4/ontologies/cheb</span><span class="invisible">i/classes/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FCHEBI_26873?lang=en</span></a> says there are 11,119 terpenoids but I see no way to download them.</p><p>Trying the web API, <a href="https://www.ebi.ac.uk/chebi/backend/api/public/ontology/all_children_in_path/?relation=has_functional_parent&amp;entity=CHEBI%3A26873&amp;three_star_only=false&amp;page=1&amp;size=15&amp;download=false" rel="nofollow noopener" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">ebi.ac.uk/chebi/backend/api/pu</span><span class="invisible">blic/ontology/all_children_in_path/?relation=has_functional_parent&amp;entity=CHEBI%3A26873&amp;three_star_only=false&amp;page=1&amp;size=15&amp;download=false</span></a> says there are 547 (482 if three_star_only=true).</p><p>With 'download=true' I get the data in full (unpaged) data in tsv format, with the same counts.</p><p>What am I missing?</p>
Andrew Dalke<p>Given two <a href="https://toots.nu/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> count vectors A={aᵢ} and B={bᵢ}, what is Tanimoto(A, B)? Followup with your preferred reference/literature citation if you have one.</p><p>Also, if you prefer another name for either of the two equations, what is it?</p>
Charles Tapley Hoyt<p>I used chembl-downloader to create some nice charts on how the number of compounds, assays, activities, and other entities in ChEMBL have grown over time</p><p>📖 <a href="https://cthoyt.com/2025/08/26/chembl-history.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">cthoyt.com/2025/08/26/chembl-h</span><span class="invisible">istory.html</span></a></p><p><a href="https://scholar.social/tags/chembl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chembl</span></a> <a href="https://scholar.social/tags/chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemistry</span></a> <a href="https://scholar.social/tags/chemometrics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemometrics</span></a> <a href="https://scholar.social/tags/chemoinformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemoinformatics</span></a> <a href="https://scholar.social/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://scholar.social/tags/rdkit" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>rdkit</span></a> <a href="https://scholar.social/tags/cdk" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cdk</span></a> <a href="https://scholar.social/tags/proteochemometrics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>proteochemometrics</span></a></p>
Andrew Dalke<p>chemfp 5.0b2 is out. Get it while it's hot! For Linux:</p><p>python -m pip install chemfp==5.0b2 -i <a href="https://chemfp.com/packages/" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="">chemfp.com/packages/</span><span class="invisible"></span></a></p><p>I'm still updating the documentation. See 'What's new in 5.0' at <a href="https://chemfp.com/docs/whats_new_in_50.html" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="ellipsis">chemfp.com/docs/whats_new_in_5</span><span class="invisible">0.html</span></a></p><p>* shardsearch - search many target files</p><p>* simhistogram - histogram all the scores</p><p>* FPB file now handles 1B+ records</p><p>* sparse count fingerprints<br> - new FPC format<br> - rdkit2fpc to make them with <a href="https://toots.nu/tags/RDKit" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RDKit</span></a><br> - fpc2fps to convert to binary fps<br> - fps2fpc for the other way</p><p><a href="https://toots.nu/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a></p>
Andrew Dalke<p>It's official - the upcoming chemfp 5.0 release will have limited support sparse count <a href="https://toots.nu/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> fingerprints, in addition to the normal binary fingerprints.</p><p>The new format is "FPC", a variant of the FPS format. Details at <a href="https://chemfp.com/fpc_format/" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="">chemfp.com/fpc_format/</span><span class="invisible"></span></a>.</p><p>There will also be "rdkit2fpc" for the four <a href="https://toots.nu/tags/RDKit" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RDKit</span></a> count fingerprint generators.</p><p>Plus "fpc2fps" with several methods to convert sparse count features -&gt; binary.</p><p>And "fps2fpc" for the reverse (it's just a list of on-bit indices.)</p>
Egon Willighagen<p>porting more content of the old Groovy <a href="https://social.edu.nl/tags/Cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Cheminformatics</span></a> with the <a href="https://social.edu.nl/tags/Chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Chemistry</span></a> Development Kit book to Markdown, here about substructure searching: <a href="https://cdk.github.io/cdkbook/substructure.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">cdk.github.io/cdkbook/substruc</span><span class="invisible">ture.html</span></a></p>
Egon Willighagen<p>Fwd: "Unleashing 4 million IUPAC names into the wild" <a href="https://chembl.blogspot.com/2025/08/unleashing-4-million-iupac-names-into.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">chembl.blogspot.com/2025/08/un</span><span class="invisible">leashing-4-million-iupac-names-into.html</span></a></p><p>"In support of Egon Willighagen's 'One Million IUPAC Names' project, we have just released more than 4 million IUPAC names text-mined from patents. Here are the details as listed on Zenodo:"</p><p><a href="https://social.edu.nl/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://social.edu.nl/tags/chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemistry</span></a> <a href="https://social.edu.nl/tags/iupac" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>iupac</span></a> <a href="https://social.edu.nl/tags/openscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openscience</span></a></p>
Blue Obelisk<p>Oscar4 5.3.0 has been released: <a href="https://github.com/BlueObelisk/oscar4/releases/tag/5.3.0" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/BlueObelisk/oscar4/</span><span class="invisible">releases/tag/5.3.0</span></a></p><p>"OSCAR (Open Source Chemistry Analysis Routines) is an open source extensible system for the automated annotation of chemistry in scientific articles. It can be used to identify chemical names, reaction names, ontology terms, enzymes and chemical prefixes and adjectives, and chemical data such as state, yield, IR, NMR and mass spectra and elemental analyses."</p><p><a href="https://fosstodon.org/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://fosstodon.org/tags/chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemistry</span></a></p>
Noel O'Boyle<p>Blog post on "A new job, a postdoc opportunity, an open biological curator role, and a user group meeting " </p><p><a href="https://baoilleach.blogspot.com/2025/07/a-new-job-postdoc-opportunity-open.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">baoilleach.blogspot.com/2025/0</span><span class="invisible">7/a-new-job-postdoc-opportunity-open.html</span></a></p><p><a href="https://mstdn.science/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://mstdn.science/tags/chemjobs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemjobs</span></a> <a href="https://mstdn.science/tags/chempostdocs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chempostdocs</span></a></p>
Egon Willigh☮gen 🟥<p>the preprint argues that a flood of mediocre articles is (partly) caused by open data making it easy to do all sorts of analyses of these.</p><p>It seems to me that the observation is correct. I have had this impression in <a href="https://mastodon.social/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> too, and, indeed, running an analysis on a data set like Tox21, ChEMBL, etc, had been getting increasingly easy.</p><p>But I do not think that <a href="https://mastodon.social/tags/openscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openscience</span></a> and <a href="https://mastodon.social/tags/FAIR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FAIR</span></a> are the problem, nor restricting access the solution.</p><p>I have some ideas about the underlying problem:</p>
Egon Willighagen<p>as Marshall College Professor Jones could have said: "This belongs in a database!" <a href="https://doi.org/10.1021/acsomega.4c10413" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">doi.org/10.1021/acsomega.4c104</span><span class="invisible">13</span></a></p><p>Well, the first pKa column and all arsonic acids are now in <span class="h-card" translate="no"><a href="https://wikis.world/@wikidata" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>wikidata</span></a></span> :)</p><p><a href="https://social.edu.nl/tags/openscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openscience</span></a> <a href="https://social.edu.nl/tags/chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemistry</span></a> <a href="https://social.edu.nl/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a></p>
The BridgeDb Project<p>new BridgeDb Datasources release: <a href="https://github.com/bridgedb/datasources/releases/tag/20270728" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/bridgedb/datasource</span><span class="invisible">s/releases/tag/20270728</span></a></p><p>"Uses UniProtKB as name, removes EcoGene, and multiple small updates"</p><p>BridgeDb Datasources is a dataset with metadata about data sources and organisms used by BridgeDb Java and downstream tools like <span class="h-card" translate="no"><a href="https://fosstodon.org/@wikipathways" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>wikipathways</span></a></span>, PathVisio, and others</p><p><a href="https://fosstodon.org/tags/bioinformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bioinformatics</span></a> <a href="https://fosstodon.org/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a></p>
Egon Willighagen<p>chemical compound identifiers in <span class="h-card" translate="no"><a href="https://wikis.world/@wikidata" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>wikidata</span></a></span>: <a href="https://edu.nl/xb8y7" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">edu.nl/xb8y7</span><span class="invisible"></span></a></p><p><a href="https://social.edu.nl/tags/chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemistry</span></a> <a href="https://social.edu.nl/tags/wikidata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>wikidata</span></a> <a href="https://social.edu.nl/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a></p>
Charles Tapley Hoyt<p>Most cheminformatics code that queries ChEMBL struggles with reproducibility.</p><p>chembl-downloader can help:</p><p>&gt;&gt;&gt; import chembl_downloader as cd<br>&gt;&gt;&gt; df = cd.query("""<br> SELECT chembl_id, pref_name<br> FROM molecule_dictionary<br> WHERE pref_name IS NOT NULL<br>""")</p><p>It's even sneaking its way into <span class="h-card" translate="no"><a href="https://masto.ai/@wpwalters" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>wpwalters</span></a></span> and <span class="h-card" translate="no"><a href="https://mastodon.social/@dr_greg_landrum" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>dr_greg_landrum</span></a></span> blogs :)</p><p>Code/Docs: <a href="https://github.com/cthoyt/chembl-downloader" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/cthoyt/chembl-downl</span><span class="invisible">oader</span></a></p><p>Preprint: <a href="https://arxiv.org/pdf/2507.17783" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">arxiv.org/pdf/2507.17783</span><span class="invisible"></span></a></p><p><a href="https://scholar.social/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://scholar.social/tags/chemoinformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemoinformatics</span></a> <a href="https://scholar.social/tags/chembl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chembl</span></a> <a href="https://scholar.social/tags/reproducibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>reproducibility</span></a> <a href="https://scholar.social/tags/chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemistry</span></a> <a href="https://scholar.social/tags/openscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openscience</span></a></p>
Kohulan Rajan<p>New Preprint Alert!</p><p>We're excited to share our latest work on <a href="https://mastodon.social/tags/ChemRxiv" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ChemRxiv</span></a>! MARCUS (Molecular Annotation and Recognition for Curating Unravelled Structures) is a web-based platform for extracting chemical information from scientific papers.</p><p>📄 Preprint: <a href="https://doi.org/10.26434/chemrxiv-2025-9p1q1" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">doi.org/10.26434/chemrxiv-2025</span><span class="invisible">-9p1q1</span></a></p><p>🔗 Try it out: <a href="https://marcus.decimer.ai" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">marcus.decimer.ai</span><span class="invisible"></span></a></p><p><a href="https://mastodon.social/tags/Cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Cheminformatics</span></a> <a href="https://mastodon.social/tags/OpenScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenScience</span></a> <a href="https://mastodon.social/tags/ChemicalDatabases" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ChemicalDatabases</span></a> <a href="https://mastodon.social/tags/AIinScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AIinScience</span></a> <a href="https://mastodon.social/tags/ScientificSoftware" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScientificSoftware</span></a> <a href="https://mastodon.social/tags/ResearchTools" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ResearchTools</span></a></p>
Egon Willighagen<p>ha, I managed to convert a CXSMILES in @wikidata via the MDL V2000 molfile and the <a href="https://social.edu.nl/tags/inchi" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>inchi</span></a> webtool into an <a href="https://social.edu.nl/tags/InChI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>InChI</span></a> (!B) and InChIKey :) <a href="https://www.wikidata.org/wiki/Q66421202#P117" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">wikidata.org/wiki/Q66421202#P1</span><span class="invisible">17</span></a></p><p><a href="https://social.edu.nl/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://social.edu.nl/tags/polymers" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>polymers</span></a></p>
Egon Willighagen<p>heading to the 2nd day of the Technical InChI Meeting in Aachen/Germany</p><p><a href="https://social.edu.nl/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://social.edu.nl/tags/inchi" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>inchi</span></a></p>
Egon Willighagen<p>heading tomorrow to the InChI Technical Exchange Meeting Summer 2025 in Aachen/DE</p><p>Looking forward to it, and particularly talking about the InChI for inorganics and trying that in <span class="h-card" translate="no"><a href="https://wikis.world/@wikidata" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>wikidata</span></a></span> :) See <a href="https://doi.org/10.26434/chemrxiv-2025-53n0w" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">doi.org/10.26434/chemrxiv-2025</span><span class="invisible">-53n0w</span></a></p><p>And also the nano InChI, see <a href="https://doi.org/10.3390/nano10122493" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">doi.org/10.3390/nano10122493</span><span class="invisible"></span></a></p><p><a href="https://social.edu.nl/tags/nanoinformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nanoinformatics</span></a> <a href="https://social.edu.nl/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> <a href="https://social.edu.nl/tags/inchi" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>inchi</span></a> <a href="https://social.edu.nl/tags/wikidata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>wikidata</span></a> <a href="https://social.edu.nl/tags/chemistry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>chemistry</span></a></p>
Egon Willighagen<p>updating some java <a href="https://social.edu.nl/tags/cheminformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cheminformatics</span></a> libraries... Euclid and CMLXOM, and possibly making a Bacting release... but need to continue working on the CDK Depiction book chapter too...</p>