Imperial College London : Machine learning techniques from Imperial and BASF advance experimental design

November 01, 2022 at 06:14 am EDT

Imperial and chemical company BASF will reveal new techniques for optimising experimental design at leading machine learning conference NeurIPS.

Three papers outlining new machine learning techniques that address important needs in the chemical industry have been judged ground-breaking enough to win acceptance at the NeurIPS conference, one of the most competitive international venues for research in machine learning.

At BASF we see digitalisation as key for strengthening our role as a leader in R&D in the chemical industry, and addressing the industry's pressing needs for sustainability and resilience.

The techniques, developed as part a wide-ranging partnership between Imperial and BASF, are designed to help research and development (R&D) scientists in chemistry and other fields improve industrial processes with minimal trial and error by predicting which experiments will return the most useful results. They could also help automate the R&D process. These advances are expected to help accelerate the development of innovative new chemical products and more efficient and sustainable methods of manufacturing.

Professor Ruth Misener, BASF/RAEng Research Chair in Data-Driven Optimization in Imperial's Department of Computing, said: 'Our research with BASF is helping digitally transform the chemical industries by adapting cutting-edge experimental design techniques to sector-specific requirements. This will help create better and more sustainable chemistry. The research is also making important theoretical advances in machine learning. The acceptance into NeurIPS is a marker of the recognition our work is receiving from the wider machine learning community.'

Dr Christian Holtze, Academic Partnership Developer and Senior Research Engineer at BASF, said: 'At BASF we see digitalisation as key for strengthening our role as a leader in R&D in the chemical industry, and addressing the industry's pressing needs for sustainability and resilience. We have long valued Imperial's cutting-edge expertise in this area. An outstanding example is our work with Professor Ruth Misener and colleagues, who share our vision of developing disruptive digital capabilities for tomorrow's chemical industry.'

Detlef Kratz and Ian Walmsley signing an agreement

Earlier this year Dr Detlef Kratz (President, Group Research at BASF) and Professor Ian Walmsley (Imperial's Provost) signed a new framework agreement, marking a new phase in the strategic partnership. Photo: Fergus Burnett

Optimising R&D

Industrial chemists routinely conduct experiments to develop high performing products such as coatings and batteries, maximise the purity of the chemicals, and minimise the material and energy costs of producing them. This requires testing numerous combinations of ingredients and reactor settings such as flow rates and temperatures.

Since the process of experimentation is itself costly, chemical companies aim to design these experiments to do the best possible job of optimising a manufacturing process in a finite number of experimental iterations.

While motivated by practical needs in chemistry, the research has translated these into mathematical techniques with general applications in machine learning.

With this in mind, it does not make sense to decide in advance which parameter values to test in each iteration of an experiment. Instead, chemists typically carry out a handful of initial iterations with differing values and use the results to form a rough prediction about which settings - for example, which temperatures - will provide the best performance. Further iterations are then designed on-the-fly to progressively improve on the precision and accuracy of the first prediction.

A more sophisticated version of this iterative process relies not on human intuition but on optimisation algorithms. In Bayesian optimisation, an algorithm combines experimental data with a statistical background assumption known as a Gaussian process to estimate the mathematical function that links the experimental variables to the manufacturing performance. This estimate, which starts out highly approximate, is not represented as a certainty but as a probability distribution over a range of possible functions.

The algorithm's aim is to find the settings that yield the best manufacturing performance, and the probability distribution helps it do this by enabling the algorithm to predict which experimental settings have the greatest chance of yielding better performances. Experimental scientists in a range of academic and industrial fields are increasingly using machine learning algorithms that employ this method.

'The algorithms are often superior to human intuition as it's really hard for people to see what's going on when there are many variables,' says Dr Robert M Lee, a machine learning specialist at BASF who carried out the research with colleagues at Imperial and BASF colleagues Dr Behrang Shafei and Dr David Walz.

'The other advantage they offer is automation. In most cases we still have a human in the loop, but we have a couple of cases where we can close that loop, meaning you can hit go, focus your attention on something else, and come back to some good results.'

Chemistry advances

While optimisation of experimental design is already a successful field, industrial chemistry is still grappling with practical constraints that are hard for optimisation algorithms to account for. These include:

-- The cost of changing physical variables - for example temperature, which is easier to change a little than a lot.

The fact that some types of experimental data come back faster than others and chemists need to make experimental design decisions before receiving all of them, known as asynchronous batch.

Multiple objectives, for example the need to minimise the cost of manufacturing chemicals while also maximising quality and sustainability.

The combination of continuous variables like temperature with categorical variables like on and off.

Multi-fidelity, or the fact that some data sources are more trustworthy than others.

Input constraints, such as facts known independent of experimentation, for example that chemical components expressed as percentages must add up to 100%.

A lot of the notoriously successful applications of machine learning are cases where you can just throw as much data as you want at the algorithms to make them work - but this is not the case in all fields.

Alexander Thebelt

The new techniques developed by researchers from Imperial and BASF accommodate many of these constraints. One paper provides an optimisation technique that compromises between multiple objectives, while another accounts for input constraints and varied types of variables. A third accounts for the costs of changing variables, multi-fidelity and asynchronous batch.

The papers were authored by PhD students and academics in Imperial's Departments of Computing and Mathematics with R&D scientists at BASF. The research was funded by BASF and the Engineering and Physical Sciences Research Council via the Centre for Doctoral Training in Statistics and Machine Learning.

New frontiers in machine learning

Professor Ruth Misener in the Department of Computing researches solutions to optimisation challenges in industry. She is the BASF/RAEng Research Chair in Data-Driven Optimisation.

While motivated by practical needs in chemistry, the research has translated these into mathematical techniques with general applications in machine learning and the potential to make a large impact outside the chemical industries themselves.

Mr Alexander Thebelt, a PhD student in Computing and lead author of one paper said: 'The machine learning community is getting more and more interested in these kinds of problems. A lot of the applications that are notoriously successful in machine learning are cases where you have loads of data, and you can just throw as much data as you want at the algorithms to make them work. This is not the case in all fields. If we can use our techniques to discover new materials that could have a huge impact on humanity and industry. It could be as big as any of the successes we've seen in machine learning so far.'

Dr Lee said it is not unusual for the application of problems from industrial chemistry to open up new theoretical approaches in machine learning research. 'People from machine learning traditionally think of data as either in a table, or maybe a picture or text. As soon as you come and say, my inputs are not these things, they're actually molecules, then people say, oh, interesting, I can represent a molecule as a graph with nodes and edges and do all sorts of clever stuff.'

'Our partnership with Ruth [Misener] is extremely valuable because her group has one foot in chemical engineering and one in AI, which is essential in this kind of work. It's refreshing for us to collaborate with academia; it gives you a different way of looking at things. It's stuff we couldn't develop in-house.'

Jose Pablo Folch, Shiqiang Zhang, Robert M. Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, and Ruth Misener. (2022) 'SnAKe: Bayesian Optimization with Pathwise Exploration.'

Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Behrang Shafei, and Ruth Misener. (2022) 'Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces.'

Ben Tu, Axel Gandy, Nikolas Kantas, Behrang Shafei (2022) 'Joint Entropy Search for Multi-Objective Bayesian Optimization'

	1st Jan change	Capi.
BASF SE	+4.53%	48.95B
SHIN-ETSU CHEMICAL CO., LTD.	+6.61%	83.41B
DUPONT DE NEMOURS, INC.	-4.46%	30.92B
PIDILITE INDUSTRIES LIMITED	+7.53%	18.01B
BRENNTAG SE	-9.08%	11.74B
EASTMAN CHEMICAL COMPANY	+7.03%	11.33B
SYENSQO SA/NV	-9.19%	9.67B
ASAHI KASEI CORPORATION	+2.79%	9.6B
SRF LIMITED	+6.41%	9.3B
MITSUBISHI CHEMICAL GROUP CORPORATION	+3.75%	8.34B

1st Jan change

Capi.

BASF SE

+4.53%

48.95B

SHIN-ETSU CHEMICAL CO., LTD.

+6.61%

83.41B

DUPONT DE NEMOURS, INC.

-4.46%

30.92B

PIDILITE INDUSTRIES LIMITED

+7.53%

18.01B

BRENNTAG SE

-9.08%

11.74B

EASTMAN CHEMICAL COMPANY

+7.03%

11.33B

SYENSQO SA/NV

-9.19%

9.67B

ASAHI KASEI CORPORATION

+2.79%

9.6B

SRF LIMITED

+6.41%

9.3B

MITSUBISHI CHEMICAL GROUP CORPORATION

+3.75%

8.34B

Market Closed - Xetra Other stock markets 11:39:34 2024-04-25 am EDT			5-day change	1st Jan Change
50.99 ^EUR	-0.64%		-0.16%	+4.53%

Apr. 25	BASF SE : Buy rating from Baader Bank	ZD
Apr. 25	BASF SE : UBS reiterates its Buy rating	ZD

BASF SE : Buy rating from Baader Bank	Apr. 25	ZD
BASF SE : UBS reiterates its Buy rating	Apr. 25	ZD
Dpa-AFX Overview: COMPANIES from 25.04.2024 - 15:15	Apr. 25	DP
BASF CEO defends cost-cutting measures and Asia strategy	Apr. 25	DP
BASF SE : Goldman Sachs reiterates its Neutral rating	Apr. 25	ZD
BASF SE : Warburg Research reiterates its Neutral rating	Apr. 25	ZD
BASF: adjusted EPS down 12.9% in Q1	Apr. 25	CF
BASF expects improvement after losses at the start of the year	Apr. 25	DP
BASF SE : Jefferies reaffirms its Sell rating	Apr. 25	ZD
Transcript : BASF SE, Q1 2024 Earnings Call, Apr 25, 2024	Apr. 25
BASF Posts Lower Q1 Attributable Income, Sales Revenue	Apr. 25	MT
BASF Backs 2024 Guidance Despite Lower Sales	Apr. 25	DJ
New BASF CEO greeted with drop in Q1 earnings	Apr. 25	RE
BASF post slight Q1 earnings drop but beats forecast	Apr. 25	RE
EMEA Morning Briefing : Shares Seen Lower; Earnings, U.S. GDP Data in Focus	Apr. 25	DJ
BASF CEO Brudermüller steps down - Chemical company presents figures	Apr. 24	DP
Due to partial sale: Board of Wintershall Dea subsidiary to be reorganized	Apr. 24	DP
Wintershall Dea management board to step down after Harbour Energy deal closed	Apr. 24	RE
German Shares Advance as Eurozone Consumer Confidence Edges Up	Apr. 22	MT
BASF: buys 49% of Nordlicht wind farms from Vattenfall	Apr. 22	CF
BASF to Buy Stake in German Offshore Wind Farms From Vattenfall -- Update	Apr. 22	DJ
BASF to Buy Stake in German Offshore Wind Farms From Vattenfall	Apr. 22	DJ
BASF to Buy 49% Stake in Vattenfall’s Nordlicht 1, 2 Wind Farms in Germany	Apr. 22	MT
BASF SE : Warburg Research remains Neutral	Apr. 22	ZD
Russia's Nornickel to move copper plant to China, CEO tells Interfax	Apr. 22	RE

BASF SE

Equities

BAS

DE000BASF111

Diversified Chemicals

Imperial College London : Machine learning techniques from Imperial and BASF advance experimental design

Latest news about BASF SE

Chart BASF SE

Company Profile

Income Statement Evolution

Ratings for BASF SE

Analysts' Consensus

EPS Revisions

Quarterly earnings - Rate of surprise

Sector Diversified Chemicals