Microsoft Corporation

MSFT

US5949181045

Software

Market Closed - Nasdaq Other stock markets 04:00:05 2024-04-23 pm EDT			After market 07:59:16 pm
407.1 ^USD	+1.52%		410.1	+0.75%

08:39am	CAC 40: transitional session, calm maintained	CF
07:44am	Wall Street: serenity returns, Tesla gains 10% after hour	CF

Microsoft : Finding and fixing bugs with deep learning

December 09, 2021 at 02:12 pm EST

Finding and fixing bugs in code is a time-consuming, and often frustrating, part of everyday work for software developers. Can deep learning address this problem and help developers deliver better software, faster? In a new paper, Self-Supervised Bug Detection and Repair, presented at the 2021 Conference on Neural Information Processing Systems (NeurIPS 2021), we show a promising deep learning model, which we call BugLab. BugLab can be taught to detect and fix bugs, without using labelled data, through a "hide and seek" game.

To find and fix bugs in code requires not only reasoning over the code's structure but also understanding ambiguous natural language hints that software developers leave in code comments, variable names, and more. For example, the code snippet below fixes a bug in an open-source project in GitHub.

Here the developer's intent is clear through the natural language comment as well as the high-level structure of the code. However, a bug slipped through, and the wrong comparison operator was used. Our deep learning model was able to correctly identify this bug and alert the developer.

Similarly, in an another open-source project, the code (below) incorrectly checked if the variable is empty instead of the correct variable on.

The goal of our work is to develop better AI that can automatically find and repair bugs like the two shown above, which seem simple, but are often hard to find. Freeing developers from this task gives them more time to work on the more critical (and interesting) elements of software development. However, finding bugs - even seemingly small ones - is challenging, as a piece of code typically does not come with a formal specification of its intended behavior. Training machines to automatically recognize bugs is further complicated by a lack of training data. While vast amounts of program source code are available through sites such as GitHub, only a few small datasets of explicitly annotated bugs exist.

To tackle this problem, we propose BugLab, which uses two competing models that learn by playing a "hide and seek" game that is broadly inspired by generative adversarial networks (GAN). Given some existing code, presumed to be correct, a bug selector model decides if it should introduce a bug, where to introduce it, and its exact form (e.g., replace a specific "+" with a "-"). Given the selector choice, the code is edited to introduce the bug. Then, another model, the bug detector, tries to determine if a bug was introduced in the code, and if so, locate it, and fix it.

These two models are jointly trained without labeled data, i.e., in a self-supervised way, over millions of code snippets. The bug selector tries to learn to "hide" interesting bugs within each code snippet and the detector aims to beat the selector by finding and fixing them. Through this process, the detector becomes increasingly capable of detecting and fixing bugs, while the bug selector learns to generate increasingly challenging training examples.

This training process is conceptually similar to GANs. However, our bug selector does not generate a new code snippet from scratch, but instead rewrites an existing piece of code (assumed to be correct). In addition, code rewrites are - necessarily - discrete and gradients cannot be propagated from the detector to the selector. Note that in contrast to GANs, we are interested in obtaining a good detector (akin to a GAN's discriminator), rather than a good selector (akin to a GAN's generator). Alternatively, the "hide and seek" game can be seen as a teacher-student model, where the selector tries to "teach" the detector to robustly locate and fix bugs.

Results

In theory, we could apply the hide-and-seek game broadly, teaching a machine to identify arbitrarily complex bugs. However, such bugs are still outside the reach of modern AI methods. Instead, we are concentrating on a set of commonly appearing bugs. These include incorrect comparisons (e.g., using "<=" instead of "<" or ">"), incorrect Boolean operators (e.g., using "and" instead of "or" and vice versa), variable misuses (e.g., incorrectly using "i" instead of "j") and a few others. To test our system, we focus on Python code.

Once our detector is trained, we use it to detect and repair bugs in real-life code. To measure performance, we manually annotate a small dataset of bugs from packages in the Python Package Index with such bugs and show that models trained with our "hide-and-seek" method are up to 30% better compared to other alternatives, e.g., detectors trained with randomly inserted bugs. The results are promising, showing that about 26% of bugs can be found and fixed automatically. Among the bugs our detector found were 19 previously unknown bugs in real-life open-source GitHub code. However, the results also showed many false positive warnings, suggesting that further advancements are needed before such models can be practically deployed.

How machine learning "understands" code

We now dive a bit deeper into our detector and selector models. How can deep learning models "understand" what a snippet of code is doing? Past research has shown that representing code as a sequence of tokens (roughly "words" of code) yields suboptimal results. Instead, we need to exploit the rich structure of the code, including its syntax, data, and control flow. To achieve this, inspired by our earlier work, we represent entities within the code (syntax nodes, expressions, identifiers, symbols, etc.) as nodes in a graph and indicate their relationships with edges.

Given such a representation, we can use a number of standard neural network architectures to train bug detectors and selectors. In practice, we experimented with both graph neural networks (GNNs) and relational transformers. Both these architectures can leverage the rich structure of the graph and learn to reason over the entities and their relations. In our paper, we compare the two different model architectures and find that GNNs in general outperform relational transformers.

Conclusions

Creating deep learning models that learn to detect and repair bugs is a fundamental task in AI research, as a solution requires human-level understanding of program code and contextual cues from variable names and comments. In our BugLab work, we show that by jointly training two models to play a hide-and-seek game, we can teach computers to be promising bug detectors, although much more work is needed to make such detectors reliable for practical use.

Latest news about Microsoft Corporation

CAC 40: transitional session, calm maintained	02:39am	CF
Wall Street: serenity returns, Tesla gains 10% after hour	01:44am	CF
Tesla could start selling Optimus robots by the end of next year, Musk says	08:16pm	RE
Wall Street: serenity returns, Tesla gains 10% after hour	Apr. 23	CF
Equity Markets Closer Higher After Latest Earnings, Macro Data	Apr. 23	MT
TSX climbs as tech stocks power broader recovery	Apr. 23	RE
Equity Markets Close Higher After Latest Earnings, Macro Data	Apr. 23	MT
Microsoft Launches Lightweight Generative AI Models	Apr. 23	MT
S&P 500 gains as investors digest positive earnings, megacap results outlook	Apr. 23	RE
Stocks gear up for Big Tech earnings; yen touches multi-year lows	Apr. 23	RE
Equities Rise Intraday as Traders Parse Earnings, Await Tesla Results	Apr. 23	MT
Weakness in S&P Services, Manufacturing Bodes Well for US Equities as Treasury Yields Retreat	Apr. 23	MT
Global markets live: Bayer, PepsiCo, Halliburton, Spotify, Apple...	Apr. 23
Microsoft Q3 Results to Provide Gauge For AI Adoption, Wedbush Says	Apr. 23	MT
Wall Street: results fuel optimism	Apr. 23	CF
Microsoft: the stock in demand two days before earnings	Apr. 23	CF
Microsoft, Coca-Cola in Strategic Partnership Focusing on AI, Cloud Services	Apr. 23	DJ
Microsoft, Coca-Cola Team Up to Boost Cloud, AI Efforts	Apr. 23	MT
Coca-Cola signs $1.1 billion deal to use Microsoft cloud, AI services	Apr. 23	RE
Corporate results bound the become the next catalyst for markets	Apr. 23
The Coca-Cola Company and Microsoft Announce Five-Year Strategic Partnership to Accelerate Cloud and Generative AI Initiatives	Apr. 23	CI
Wall Street: a little momentum after solid results	Apr. 23	CF
Traders Eye Tesla, Other Big Tech Earnings, Driving Muted Premarket Action for US Equity Futures	Apr. 23	MT
Futures point to higher open as more earnings roll in	Apr. 23	RE
US Equity Futures Post Narrow Gains Pre-Bell as Big Tech Earnings on Tap	Apr. 23	MT

Chart Microsoft Corporation

Duration

Period

More charts

Company Profile

Microsoft Corporation is the world's leader in the design, development and marketing of operating systems and software programs for PC's and servers. The group also builds and sells computer equipment. Net sales break down by activity as follows: - sale of operating systems and application development tools (47.9%): primarily for servers (Azure, SQL Server, Windows Server, Visual Studio, System Center, GitHub, etc.) and (Windows); - development of cloud-based software applications (23%): programs for productivity (Microsoft 365; Word, Excel, PowerPoint, Outlook, OneNote, Publisher and Access), integrated management and customer relationship management (Dynamics 365), online file sharing and management (OneDrive), and unified and collaborative communications (Skype and Microsoft Teams); - sale of video gaming hardware and software (7.3%) : mainly Xbox; - enterprise services (3.6%); - sale of computers, tablets and accessories (2.6%); - other (15.6%). The United States accounts for 50.4% of net sales.

Sector

Software

Calendar

2024-04-24 - Q3 2024 Earnings Release

Related indices

Dow Jones Industrial , S&P 500

More about the company

Income Statement Evolution

More financial data

Analysis / Opinion

Microsoft Growth to Be Driven by Expanding AI Spending, Morgan Stanley Says

April 19, 2024 at 09:22 am EDT

Bill Gates, co-founder of Microsoft, on AI: Less work, more wealth

January 16, 2024 at 11:51 am EST

More Strategies

Ratings for Microsoft Corporation

Trading Rating

Investor Rating

ESG Refinitiv

C+

More Ratings

Analysts' Consensus

Sell

Buy

Mean consensus

BUY

Number of Analysts

Last Close Price

407.6 USD

Average target price

471.2 USD

Spread / Average Target

+15.62%

Consensus

EPS Revisions

Estimates Revisions

Quarterly earnings - Rate of surprise

Company calendar

Sector Other Software

	1st Jan change	Capi.
MICROSOFT CORPORATION	+8.25%	3,028B
SYNOPSYS INC.	+2.45%	80.47B
CADENCE DESIGN SYSTEMS, INC.	+3.45%	76.81B
DASSAULT SYSTÈMES SE	-11.97%	54.2B
ATLASSIAN CORPORATION	-16.15%	51.74B
PALANTIR TECHNOLOGIES INC.	+25.98%	47.88B
THE TRADE DESK, INC.	+12.76%	39.62B
SEA LIMITED	+53.11%	35.14B
TAKE-TWO INTERACTIVE SOFTWARE, INC.	-11.95%	24.18B
ROBLOX CORPORATION	-20.60%	23.22B

Other Software

Microsoft Corporation

Equities

MSFT

US5949181045

Software

Microsoft : Finding and fixing bugs with deep learning

EPS Revisions