Log in
Show password
Forgot password ?
Become a member for free
Sign up
Sign up
New member
Sign up for FREE
New customer
Discover our services
Dynamic quotes 


SummaryMost relevantAll NewsAnalyst Reco.Other languagesPress ReleasesOfficial PublicationsSector newsMarketScreener Strategies

VMware : Measuring Constructiveness and Inclusivity in Open Source – Part 3

10/14/2021 | 04:32pm EST

Description of icon when needed5 Min Read

Part 1 of this blog series discussed the importance of identifying constructiveness and inclusivity in open source and Part 2 explained the data representation techniques used. In this last part of the series, we will focus on training models to predict constructiveness and inclusivity.

Prediction Models

With the data tagged and transformed, we were ready to train the model. We experimented with a variety of machine learning models along with different data representations to determine the best model to predict constructiveness and inclusivity from the input data. Based on our hypothesis of the importance of measuring back and forth communication, we concluded that a 3-dimensional matrix data representation discussed in Part 2 would yield the best results. Similarly, computer vision models also use 3D matrices to represent images to build machine learning models. Due to how the data and images are represented, we felt that Convolutional Neural Networks - state of the art for computer vision - would also perform well on our data.

Table 1: Predicting Performance - Comparing Results Between Algorithms (The table indicates the performance in predicting "constructive" and "inclusive" labels for all data representations and machine learning algorithms tested. The "No Fit" indicates that the model did not learn and predicts only one label.)

The results highlighted validate the utility of our novel data representation for conversations as the machine learning models trained on that data representation show significantly higher accuracy. Model metrics for the 2D and 3D matrices are very similar, so additional training is required to conclusively determine the best data representation. Overall, we were also able to train Convolutional Neural Networks that can predict constructive and inclusive labels with high accuracy (80% and 90% respectively).

Understanding the Prediction

Ultimately, we want to enable contributors to be more constructive and inclusive in their feedback. These insights can be obtained through an analysis of how and why the machine learning models make a prediction. Using Tensorflow Explain'sSmooth Gradient, we combine the node activations and gradients from each layer of the neural network for a particular input to obtain the output contribution of each input.

For example, the model predicts pull request #86 of Kubeflow KFServingto be constructive. Below is how the model analyzed a commentwithin a pull request:

"Currently ValidateUpdate ignores the old object and just calls Validate-which equates to: 'you validate yourself'… If we change that (and I agree it will be nice not to have to), we will have to add a method on the interface that checks if the fields that changed are mutable by passing in old and new to validate… so in that sense an extra fn.. so going to keep the method as it is for now."

- Comment on Pull Request #86 of Kfserving project by Kubeflow

Analysis of the comment for Constructiveness

Based on the analysis, the phrases "we changed that," "by passing in" and "that changed are" contributed positively to determining constructiveness. Similarly, we can also analyze phrases important to inclusivity.

Current Shortcomings

In this experiment, our above implementation seems to determine and analyze constructive and inclusive communication with high accuracy. However, we have only tested the framework and results for one small project due to resource constraints. In particular, labeling, tagging and transforming data is time consuming and labor-intensive. A broader sample size of projects and data labeled by people with diverse cultural backgrounds would reduce data bias and improve the models.

Future Work

In this three-part blog series, we discussed the importance of constructive and inclusive communication in open source, devised possible representations for conversational data, and developed models to predict and understand constructiveness and inclusivity. While the current implementation requires annotated data for each project, further work in transfer learning could make the models extensible to new projects without the need for newly annotated data specific to that project. The current framework can also be combined with generative models like GPT-3to build an application that nurtures constructiveness and inclusivity in real-time just like Grammarlynurtures proper grammar.

If you're interested in collaborating on this project, come join us atvmware-labs / ml-conversation-analytic toolon GitHub. We welcome all contributors!


VMware Inc. published this content on 14 October 2021 and is solely responsible for the information contained therein. Distributed by Public, unedited and unaltered, on 14 October 2021 20:31:08 UTC.

ę Publicnow 2021
All news about VMWARE, INC.
01:55aVMWARE : Leveraging Automation in VMware Skyline Advisor Pro
01/14VMWARE : Photon OS 4.0 Rev 2 is now available
01/14VMWARE ON VMWARE : an Important Partner Across the Entire Business
01/14VMWARE : Feature Friday Episode 77 – DevOps Service Opportunity
01/14VMWARE : Named One of America's Most JUST Companies for 5th Consecutive Year, Awarded Top ..
01/13VMWARE : Attends White House Summit on Open Source Software Security
01/13VMWARE : Announcing VMware vRealize Automation SaltStack SecOps Cloud
01/13VMWARE : Navigating Change & Uncertainty Early in Your Career
01/13VMWARE : Announcing VMware NSX Advanced Load Balancer (Avi) with Cloud Services and the Av..
01/13NOW AVAILABLE : vRealize Network Insight 6.5
More news
Analyst Recommendations on VMWARE, INC.
More recommendations
Financials (USD)
Sales 2022 12 843 M - -
Net income 2022 1 736 M - -
Net Debt 2022 9 405 M - -
P/E ratio 2022 30,5x
Yield 2022 -
Capitalization 52 621 M 52 621 M -
EV / Sales 2022 4,83x
EV / Sales 2023 4,12x
Nbr of Employees 32 300
Free-Float 28,6%
Duration : Period :
VMware, Inc. Technical Analysis Chart | MarketScreener
Full-screen chart
Technical analysis trends VMWARE, INC.
Short TermMid-TermLong Term
Income Statement Evolution
Mean consensus OUTPERFORM
Number of Analysts 31
Last Close Price 125,18 $
Average target price 152,45 $
Spread / Average Target 21,8%
EPS Revisions
Managers and Directors
Rangarajan Govind Raghuram Chief Executive Officer & Director
Sumit Dhawan President
Zane C. Rowe Chief Financial Officer & Executive Vice President
Michael Saul Dell Chairman
Jason Conyard Chief Information Officer
Sector and Competitors
1st jan.Capi. (M$)
VMWARE, INC.8.03%52 621
ACCENTURE PLC-14.76%223 324