Apply

First Name*

Last Name*

Email*

Choose Program*

Academic experience in:(Which of these: Probability & Statistics, Calculus, Linear Algebra or none)

Mobile (Type your number without dashes)*

Country of residence*

utm_campaign

I agree to receive information from Israel Tech ChallengeI agree to receive information from Israel Tech Challenge

First Name

Last Name

utm_campaign

Choose Program*

Preferred Specialization

Mobile (Type your number without dashes)*

Linkedin Address (URL)*

Country of origin*

Country of residence*

Academic Institution*

Academic Degree

Do you have programming knowledge?

How did you hear of US?*

utm_campaign

Qualitative Transcription Metric, Verbit.AI

Abstract

When comparing a given transcription to the “ground truth” of an audio, the simplest way to evaluate the transcription quality is by computing the fraction of words that are different. This is called Word Error Rate, or WER for short. The problem is, of course, that this includes cases which are irrelevant/insignificant for a human reader – not all diffs are created equal. Some matter more than others, and some are totally meaningless. This renders our existing quality metric (WER) very noisy. The project aim is to (start to) build a separate metric (or set of such metrics), which are “human-reader-centric”, i.e. designed specifically for the purpose of providing a quality score which is robust to the aforementioned complications.

Challenges (at least two)

The first challenge was to understand what was considered as irrelevant and finding how common this in the jobs has been done.
After overcoming the first challenge, I needed to think about methods to take care of the irrelevant changes/differences in the jobs.
Implement those methods in python code.
Thinking about how to improve the methods after receiving first results.

Achievements (according to KPIs)

Defining changes and evaluating their scope. (examples for changes: Capitalization, false starts speaking and inaudible cases)
Defining methods to deal with the changes (for example: ignoring X words and ignoring X seconds)
Writing functions to implement those methods.
Testing how those methods affect the WER metric.

Future project development

Test my methods on more jobs to compare the results and understand better which method is good for most of the cases.
Combine methods and test the results.
In the project I focused on a few changes, so it is necessary to develop more methods to encapsulate on more changes.