What you’re reading this the result of dictation from an M2 Max Macbook Pro, to which I upgraded after some time of pretty much just realizing that I needed a new computer.

I don’t know how good it is going to be, but I have expectations that it will be a little better compared to what we have on the iPhone and other devices that we can use as part of the Apple system. So far, it seems there are some issues with the system because it recognizes some words incorrectly – partly that could be because I am pronouncing those words in a way that is not really concordant with what the algorithm is able to imitate. Hence, it ends up transcribing the wrong things, because what it hears is, in fact, something incomprehensible.

When it comes to automatic speech recognition, the algorithms that process our speech face problems when the expected input is ambiguous and can possibly match several possible outputs. The automatic speech recognition function is, after all, based on making predictions, based on the likely input that would be expected from a user over time. In the event that the prediction made by the algorithm is incorrect, the software may be penalized as a result of the input not matching the prediction. The ultimate goal is to have a scenario where every prediction generated by the model, as the person speaks, matches what the person intended to say with the least possible corrections required.

Let’s start with the voice’s pitch and ambient noise.

For an algorithm to receive the correct input, the sound it receives should match the specific waveforms used in the training dataset that guided the development of the automatic speech recognition system. If there is a deviation in the sound pattern, either due to ambient noise or tone of voice, the generated text can be inaccurate because the system receives problematic inputs in the first place. This is natural, because if you feed garbage in, you will naturally receive garbage out. There is a sort of equivalent exchange at play.

Let’s now talk about something different altogether – the device’s processing power. I’m not sure if this is a factor with the automatic speech recognition system on Apple devices, but I suspect it is. Each device needs to perform complex calculations that allow it to make predictions at a relatively high rate of 100 to 130 words per minute. If you look into the memory usage of the computer as it performs this process, you may see that the memory does not get consumed at a high degree – that is something I plan to test in the coming days.

The last possibility is that there is a problem with the algorithm itself in recognizing certain patterns of waves. There can be some variation depending on the quality of the input, but it’s also possible that the algorithm used to process the data can make mistakes on occasion. I’m confident that Apple is making strides to improve the output quality of its algorithms, and for that reason, I am optimistic about the improvements they can bring about.

I give this much thought because it is one of the most important aspects of any generative artificial intelligence system. These systems rely on good input into the algorithms, and speech recognition systems are extremely important as sources of input. As we interact with natural language on a daily basis, which is often faster than typing or pressing buttons on our devices, I believe that accurate dictation and procedural proofreading of our daily writing will lead us to a new era of AI.

I can’t wait to see what the future holds, particularly as we approach the end of 2023 with developments like iOS 17, AI generators, and all the different forms of technology that are becoming more prevalent. Time passes, and it is somewhat sad to think that we are coming closer to the end of our lives before these things come to fruition – still, the show is not over until it is over, and I can’t wait to see what is going to come!

Leave A Comment

Recommended Posts

AUAM-NAMSA Corporate Pathways Networking Dinner – Some small reflections.

The journey has been pretty interesting in a whole bunch of different ways. Amongst other things, I’ve received a partnership with GerakBudaya, and also in conjunction with the American Universities Alumni Association of Malaysia and the National Assembly of Malaysian Students in the United States of America (NAMSA), we are organizing this event. Here’s the event PDF to showcase that this is quite real. Honestly, even the term Corporate Pathways is a bit of a misnomer. I don’t know how corporate this event is going to be, primarily because it’s mainly going to be focused upon experience sharing and how people thought about their lives in the course of GLC in relation to the education that they received while they were in the U.S. There is a whole backstory to this that goes back about a month or two months or so, but has led me to a place of networking, meeting different people, and establishing friendly chat after friendly chat, rather than transactional moment after transactional moment with a bunch of different people with whom I probably never imagined at the outset that I’d be on casual speaking terms with. Anyway, here are some of the people who will be on the panel. GLC Panel: Nick Khaw, Head of Research at Khazanah and alumnus of Harvard University. Aik Chong Phuah, previous CEO of Petronas Digital and alumnus of the University of Chicago Booth School of Business. Brendan Yap, Senior Executive at the Securities Commission and alumnus of NYU. Athirah Azmi, former Manager, Client Coverage at Maybank Investment Bank and alumna of the University of Chicago  Private Sector Panel: Audrey Ooi, co-founder of Colony Coworking Space and an alumna of Mount Holyoke College, also known as @fourfeetnine. Dato’ Vincent Choo, Founder, Urban Ground Group, Franchisee Subway; alumnus of Eastern Michigan University. Yen Ping Teh, APAC Product Partnerships at Google and an […]

The Most Moral Business?

From a very young age, I had always understood businesses as entities that bring value to society in meaningful ways that otherwise it would not receive – and that’s one of the many reasons why the bookstore had always stood out to me as one of the ultimate business forms; after all, what could deliver more value to society than the transmission of knowledge itself for the profit of mankind, far above and in excess the monetary value that is paid for them? There is a huge camp out there of people who think that knowledge should be free and fairly accessible – I don’t dispute this to be a valuable point of view, but I also consider it to not be tenable; sure, we can say that society is the core issue and that knowledge *should* all be made available to the universe, but the fact remains that creating and obtaining knowledge is a costly effort, and if nothing else, the process of assembling something together whether through study or physical organization is something that manifestly should be rewarded. A critic might very well say that that’s the capitalist in me speaking and fighting against all the tides of justice, but I believe that life is about managing circumstances, finding spaces in the interstices of ideals and reality where there is a happy balance, and pushing forward in the thrust for existence. In my ethical system and paradigm, that bookstore is the most moral of the businesses out there – maybe a little strange if you consider that Amazon had started out as an online bookstore perhaps, but that’s what we’re working with here x) Anyway, I’m very happy to announce that I have a brand new partnership with the online bookstore GerakBudaya, for whom I’ve begun writing reviews – most immediately, a review of Dr. Toh Kin Woon’s “Malaysia’s […]

Fighting Perfectionism

If there were a flaw that I would observe about myself, I think it would be this. That I am someone who thinks a little too much about what other people think. What this means sometimes is that I tend to not want to release things because I fear that they won’t be appreciated, people won’t like them, or anything else of that nature. And granted, that doesn’t happen across everything. If it’s just an Instagram story, for example, I just enjoy releasing dumb, interesting things that reflect the different random things that happen during the course of the day. But when it comes to more extended creative projects, I think that I am restrained in some ways by feeling that everything needs to be perfect. Everything needs to somehow just match up with the best. And in some ways, that is kind of negative, because if you were to just try for things that aren’t always good, if you’re a new perfectionist, then what ends up happening is that, sure, you might end up creating a good product, but what will probably happen also is that you’ll just not release anything. And believe me, that does happen quite a fair bit for me. I am the kind of person who tends to hem to haw, to just kind of let things go by because I think, “Oh, it’s not ready. Oh, I shouldn’t release this. Oh, more needs to be done.” And that’s just my nature. I tend to be pretty careful with a lot of different things. But at the same time, I’d like to try to get past that and I think that that can happen in at least two different ways. One is that I reach a level of ability whereby the things that I do end up matching what I consider to be a nice standard. […]

Time and Existential Risk

Time is the ultimate existential risk.  I know this not just from deferring to the vague idea of theory or of an arcane book somewhere. It’s something that I’ve experienced deeply and intimately from my own struggling with the realities of procrastination in a world that seems to tolerate it on the surface, but only because I wasn’t able to appreciate what that procrastination brought about, the end of many different things, on timescales that I did not appreciate and therefore could not apprehend.  Now the thing is, as a child, you maybe don’t appreciate that time is passing. Far from it. When you’re in the midst of school, it feels least like time is passing. In boring afternoon lectures, it can feel like the entire moment has lasted more than a lifetime plus some change. And still, the teacher is there yapping about something that you don’t really care too much about.  All of us understand in life that all things come to an end.  Human lifespans are finite, averaging 72.6 years according to the WHO in 2019, with exceptions like Japan at 84.5, Singapore at 83.9, and Monaco at 89.4. The average career is 40 years.  School concludes in no more than 6 years at elementary level, 5 years in secondary. It kinda depends what kind of schooling system you go for and where you were born, but that doesn’t really matter. From school, maybe you work, or if you’re lucky enough, you go on to university. Then poof, 4 years later, maybe you graduate, get a new degree, and so on so forth – but in the moment, it feels like you were engaging with a distant theoretical concept, and the temptation draws us in to believe a quixotic ideal: “This moment will last forever.” But it will not. Though you may feel that it will last forever, […]

Sleep In Progress

I lie in bed, my eyes are closed. My over-caffeinated heart is beating, “Thud, thud, thud, thud, thud” in the depths of a chest that will not rest. This is the fruit that has grown from the coffee after a joyous evening, reminding me that not all that is pleasant is desirable and not all that is enjoyable yields longer term joy. It doesn’t help that the sound of laughter repeats from outside my window, innocuous at best at 10am but odious at 1am – a one digit separation, but a universe apart. The judgmental part of me considers the inconsiderate neighbors and their inconsiderate family members just casually laughing away with no thought for anyone whom they may affect, and somehow in a flash, it dawns upon me: This is the nature of true evil. Evil, in all likelihood, is not callous nor is it particularly malicious. It’s just the result of doing what feels right at a particular time, feeling that one is justified, with no deeper meaning underpinning it. I don’t suppose that my neighbors have a particular wish for me to experience sleep deprivation. No, rather they wish merely to enjoy their weekend. Does it mean that I am unaffected by what they are doing? No, by no means. I certainly am affected. That is why I am lying in bed right now, talking to myself. I think this characterizes most evil in this world. The consequence of people simply just following their self-interest in ways that they justify. “I was so tired the entire week.” “I need to celebrate.” “We’re all together right now. Why shouldn’t we make as much noise as we want?” “It’s just for the weekend. Surely they will understand.” But I don’t. I don’t think that there is anything particularly noble, wonderful, or celebratory about this whole coincidence of things because it […]

GRE – A First Test.

Today, I did my very first GRE at home, and let me tell you – that’s a fascinating experience to have. It was my first time experiencing at-home proctoring, which the test agency, ETS, carried out by means of a proctor who did some of the following things: If you haven’t heard of any of this before, rest assured that you’re in good company – I hadn’t either! Either way, not too sure what actual score I’m going to receive, but the test was an interesting one (as interesting as your standard issue GRE can be); it was a little bit disappointing to see the unofficial score that I’d received, given that I’d done a bit of practice and had done quite well, but I was shocked to see the result that had appeared on the screen – an estimated 320, despite PowerPrep Plus practice test scores of 331, 334, and 330 previously. I have no idea how well ETS performed the estimation, but rather than reject it, I’ve decided to take it seriously and take action along the way, because what this result showed me was that maybe I had been a bit too presumptuous about my own strengths and abilities to the point that I thought that I’d be able to game the test. Still, I think it was a shock and a wake-up call to try a little harder and not to give up and, accordingly, to carry on with the articles of mental development that the GRE has brought about – because the GRE certainly has brought about many of these things in the week or so that I’ve spent preparing. #1: To become more efficient at writing. The GRE has definitely taught me to write much more efficiently and to structure my writing – to get much better at deciding what parts of my argument should […]