Once in a while, a technology comes along that just completely transforms the way that we think, we live, and we experience the entire world. 

​​Certainly the entire world has been captivated by the rise of AI in recent days – how could it not, when millions of influencers around the world endeavor on a day to day basis to showcase the 500th AI tool that you ABSOLUTELY NEED TO USE on a day to day basis?

Well, I don’t know much about technologies beyond ChatGPT, to be honest, but there is definitely one thing that has come out from it, which is probably the feature that I use the very most out of pretty much everything on the planet, and that is automated speech recognition, specifically, the OpenAI ChatGPT Whisper ASR Recognition System.

Automated speech recognition is how I’m communicating everything here to you. It is how I’m putting down my thoughts, word by word, by simply sitting down next to this open door on a rainy morning, narrating out the story as if I were talking to you. 

The Automated Speech Recognition Algorithm, which is in this case the ChatGPT Whisper app, is transcribing everything that I’m saying with an almost perfect accuracy, but perhaps with some small issues with punctuation that I will fix after the fact. It is incredible, tremendously accurate, and something that I could have never imagined just three to four months ago.

As a result of this technology, as you read, you’re actually listening, in a sense, to what I said on that morning when the air was cool and the rain was falling, it was 7.49am in the morning, and 48 seconds had passed on the clock. 

As I narrated these words then, eventually, the clock turned to 7.50, indicating a shift in time. 

I made a mental note to myself at that time that I would look at the total number of words that had transpired during this time, because it bears a significant meaning, which I would like to elaborate upon. 

Automated speech recognition is wonderful for me. It has done some of the following things:

  1. Dramatically sped up my rate of interactions,
  2. Reduced the strain on my body
  3. Given me extensive practice in public speaking and articulation. 

Let me go into all of these one after another.

Dramatically sped up my rate of interactions.

Every form of communication has its idiosyncrasies, and can be considered a skill in its own right. 

In terms of speed, handwriting is the slowest, clocking in at 12-20 words per minute. 

Typing comes next at around 40-80 words per minute, depending on the typist, with some people going far above that, assuming they’ve had professional training. 

And finally, speaking clocks in at around 130-150 words per minute. 

The clear corollary of all this, I think, is that if a person adopts speaking as their dominant mode of communication, that they will be able to get things done at a much faster rate than they otherwise would be able to by texting or writing to others. In fact, this is the reason why communicating by phone or meeting in person can be so much more efficient relative to just sending out messages and waiting for email conversations to proceed. 

As a user of automated speech recognition technology, I get to take advantage of the fact that I can speak quickly in order to create documents, which in turn helps me to very rapidly think of different things. 

In a sense, I am constantly on my toes and crafting different ways of dealing with problems, for the simple reason that I can now deal with more of them within a smaller amount of time than I used to. 

Rather than taking, say, 5-10 minutes to reply a text message, as I did before, I can now simply speak out the contents of what I want to say to others, very simply and very easily, without really thinking too much about typing down all the words, which itself is a long exercise. 

This has allowed me to take many more opportunities within shorter periods of time, and in turn to try faster and more frequently. This, for me, has been a game-changer in many different ways, and the consequences are something that I have yet to even fully understand, although they will need to be accompanied by developments in planning in the days to come.

Reduced the strain on my body

Texting is physically strenuous. 

It might not initially seem so, but it absolutely is, because whether you’re typing on a computer keyboard or on a phone, what is happening is that you are actuating your fingers and joints to hit keys over and over again for the purpose of communication, which requires you to move your fingers around in such a way that you can create the desired pattern of output on the screen. 

Having said that, these implications alone are far from the only problems that one could associate with texting for long periods of time. Here’s a helpful list created by ChatGPT.

Using automated speech recognition can reduce repetitive strain by reducing the repetitive movements associated with typing and even text claw, which is something that I discovered when I began using these technologies after a long period of time in which I had begun facing finger and wrist pain from texting too much and making use of devices too extensively. 

This has been a game changer for me because I was in so much pain on some days that I found it difficult to type but found it necessary to continue typing anyway. 

Being able to address this problem was truly incredible because it opened up possibilities of communicating without a situation of pain. It’s also worthwhile to note that typing via automated speech recognition allows a person to communicate with better posture and under more relaxed circumstances. Even as we speak right now, I am casually narrating all of this while sitting down on my secret lab chair and leaning back with my feet on the gate in front of me. Just communicating everything that I intend to say in a relatively free manner and dramatically faster than I otherwise would have been able to just a short while ago. This helps to prevent a variety of different problems associated with texting or writing which include text neck which occurs when a person’s neck is hunched over as they look at a device. And also the postural problems associated with maintaining one’s eyes upon a device in an attempt to look at the words that are being produced on a document. I am simply at the moment just holding my phone in my left hand watching the transcription seeing if it is going out properly and everything is just coming out easily. Even right now, for that matter, I am witnessing other benefits such as reduced eye strain. My eyes are closed as I am narrating all of this and it can seem as if I am speaking to myself but that is not exactly the case. Still, what is real though is that I am able to perform this entire task without looking at my phone screen even for a single moment which allows me in turn to go right ahead and just type out everything without fear or favour. It’s also worthwhile to note that this benefit offers significant advantages in accessibility to anyone out there who needs such access. It’s allowing a person to potentially communicate at an extremely quick rate even if they happen to suffer from a disability that would otherwise impede them from performing this type of communication. It’s also worthwhile to note that this allows for multitasking and allows me in turn to do different things and to look around me as well. Positioning my focus between different things rather than just looking at the screen and having my entire attention focus on the process of creating a single document. Which in turn leads into a lower cognitive load overall and in turn into a very natural communicative aspect which is manifested in the words that I am saying at this point in time.

Given me practice in public speaking and articulation.

Using an ASR system is a truly unique experience. 

It’s an experience that involves speaking to a device, which in turn involves thinking about what you’re going to say, thinking about how it’s going to come out, and arguably thinking more intuitively about what a listener on the other side might actually be hearing, feeling, or imagining. 

It’s not a complete substitute for speaking to an actual audience, or to actual people, of course, but the very act of articulating things through speech itself gives a person significant practice in understanding how to develop their manner of speech, the cadences of their voice, the structure of their thoughts, and the rises and falls of emotion along the way. 

This is tremendously good practice, I think, for situations in which a person might, at a later point, communicate via speech particularly as one can do it in a relatively relaxed manner, as I mentioned in the previous few points, while at the same time communicating at a much faster rate than they otherwise would if they were simply to go ahead and type things out. 

This type of practice, affirmed constantly and experimented with over time, is something that has dramatically improved my personal speaking skills. It has made me more articulate, not only because I have had to think about what to write about, and because I do so much more frequently now, but also because it constantly keeps me on my toes, forcing me in various ways to source things from my imagination and my thoughts in order to put them upon the page, which in turn reinforces a continual cycle of thought retrieval, building, structuring, and articulation that leads itself into a reinforcing cycle that develops, or at least I feel has developed, my process of thought formation in many different ways and will continue to be tremendously useful over the course of time for practice purposes, creation purposes, and in turn preparing me to speak on progressively larger stages in the days to come.

Using an ASR system is a very unique experience. It certainly is a brand new technology. But at the same time, it is something that allows a person to engage with his or her human abilities on a level that I have never truly encountered before, and that stands as unique to me within the history of humanity.

Concluding thoughts

I spoke extensively about the ways in which using ChatGPT’s Whisper ASR system has dramatically sped up my rate of interactions, reduced the strain on my body, and given me practice in public speaking and articulation. And I cannot emphasize more that this has been transformative, to say the very least.

 The last time I checked the word count of the piece, and prior to saying these words, it was already above 1,800, and I had started this project at 8am in the morning, which testifies to just how quick it is. I conducted the entire thing without straining my neck in any way, and in fact, in an ergonomically comfortable position, either while sitting down on a chair in a reclined position, or while standing up and just casually carrying my phone around, reducing the possibility of any incidence of text neck, and completely resolving the problem associated with text claw and repetitive strain injuries. 

Along the way, after having experienced both of these incredible benefits, I received some very extensive practice in public speaking and articulation, which admittedly was directed towards this device, but at the same time was also directed towards everyone who was capable of hearing me within a certain range. This in and of itself has been truly incredible, and the process of writing this piece has been a wonderful practice session. If it is not clear to those out there who haven’t used this before, I truly consider this to be a transformative technology, and one that has catalyzed a sea change within my own personal life. 

ChatGPT continues to hold the throne, of course, for technologies that have enabled the possibility of seemingly reasoning, AI systems that are capable of creating outputs that shock us and that, even now, I am continually learning from. In fact, it even houses the technology that has made it possible for me to make use of what I am making use of at the moment, possibly training its systems on the type of communication that I have chosen to initiate. Perhaps OpenAI’s engineers will keep track of this entire speech or conversation that I have initiated and that have in turn released onto their servers, but that for me is not ultimately a matter of concern, because I do believe in the idea that if one’s thoughts are sound and otherwise valuable, that they should be shared with the world anyway. As a matter of individual and collective responsibility, whether these thoughts are, of course, worthwhile, desirable, and may lead to a causal and beneficial impact upon the world, of course, is a matter of contention somehow or another. But one that I believe is being continually refined and created through the development of these technologies themselves. Of course, a person should repose self-awareness in the extent to which they truly are able to contribute, and should not overstate or over-inflate the extent of their capabilities. For what I can say, however, is that it feels a tremendous privilege to live in this day and age, and to be able to make use of something that has had such a profound impact on our ability to interact, to utilise our cognition, and to create in turn. It may seem like something trivial or otherwise small in the grand scheme of things, but this for me has been truly profound, and it is one of the many things that I cite and will continue to cite as my rationale for undertaking a journey of constant self-improvement as I move forward into the future. 

Thank you for reading (or was it listening?) and I will see you in the next piece.

Leave A Comment

Recommended Posts

The Body is the Hardware, The Mind is the Software

The analogy was interesting when I heard it first, and it remains interesting now because it resonates with me on at least a couple of different levels. Our bodies, the physical parts of us, are basically analogous to the hardware of a computer, running along with different parts here and there – upgradable, we can improve them by increasing the quality of the resources that go into them; improvable through good maintenance, we can exercise, sleep well, and do all sorts of other things to improve the hygiene on that front. Our minds, on the other hand, are the software – the programming that decides how we interact, think, solve problems in specific situations; the algorithms and little decisions that decide how we react to different scenarios and confronting different situations, whether it comes to talking to girls, investing, selling, marketing, or doing business with others. It is nice to think that the mind is upgradeable, and that somehow you can improve yourself through an act of willpower by learning certain things. Through sitting down and unlocking the secrets of the universe one after another, through a mixture of magic and also destiny. But who’s to say exactly how that should happen? Sorry, that’s a silly question. The answer is that it’s you.

A Small Change of Perception

I began this morning with the headline “How Kamala Harris Burned Through $1.5 Billion in 15 Weeks”, on NYT. It was an interesting head to a week of what was for me listening to, understanding, and better reckoning the world after Donald Trump was elected 47th President of the United States, and the first of many headlines I’d seen about this on New York Times. Some might view this as evidence that the media is cleaving towards the Trump administration as the chickens fall in line and loyalty becomes a Sine Qua Non in the era of an evil empire – but I think a little differently, because I feel like it’s teaching me something about reality. Look in the comments, and you will see how people have responded – people saying that the presidency is “deeply unserious”, highlighting any number of things that they disagree with even as they say that NYT’s “focus” is wrong, that Kamala “tried to save democracy”, and everything in between. If I really think about it, all of these seem about as valid as saying that Trump is secretly a genetically modified orange with a toupee made of cheese.  The entire idea of NYT is that it’s one of the most respected voices in journalism, that alongside publications like the Washington Post, it defines the Overton Window – the space of ideas that are acceptable to the public at any given point of time.  To the extent therefore that NYT’s function is valid for this purpose, I’m more likely to say that these critics are the ones who don’t make sense – That the calls against that validity are the true measure of what doesn’t make sense. I’ve often heard this idea that in fact a Trump presidency might be a situation where the inmates are running the asylum, but upon further inspection, I’m no […]

Perfectionism to eliminate

…And another has come. We are progressively moving towards the end of the year with each new beginning. This is I believe the 46th week of the year out of 52, and it’s leading towards the end of the year; Donald Trump is now president, filling up his cabinet with appointee after appointee as people contemplate things; you might believe that we’re at the start of something HUGE, as Donald might call it, world-shaking, incredible. But I think while that’s good, it’s good to look at something that I’ve wanted to get rid of for quite a while: Perfectionism. I am a victim of it, and I can’t deny that it follows me everywhere, making me question myself and whether what I’m putting out into the internet is either good or worth it – I second guess myself frequently, taking down blog posts that I think aren’t great or that aren’t well worded, thinking that perhaps I should rewrite or otherwise. I think that this is a very negative behavior, because frankly I don’t really care too much about what people think and secondly, it doesn’t really matter what they think – at least in relation to how I think about myself. So I would like to eliminate, therefore, the perfectionism that makes me rewrite things, redraft things, take wayyy too much time to release things. This is the next thing to change, and it’s a good thing to shift it in this year of 2024 – even if it is the only lesson that I will have learned by the end of this year, I think that it will have been a worthwhile one. Here’s to the next!

Creation

On an empty page, the pencil traces the dotted line, the circle, the shape.  From the movement of the pencil, a million universes appear, timelines splitting into their multiple component parts in a universe of endless possibility as millions more appear, each one a multiverse of possibilities as the pencil moves, tracing by movement, through which, across billions of possible environments, worlds, universes, colors, shapes, and relations. Look up from the page and perhaps you may see the created universe – Breathe in and you may appreciate its harmonies, the unity of physical constraints, of physical laws interweaving to make existence possible.  But is that universe truly greater?  Look down at the once empty page, no longer so empty. I claim that if you look further, there you will see it: Here lies the immortal beginning of every endeavor, the step not taken – a journey not yet made of a thousand miles as yet untraveled that you can begin, where you are, with a single stroke of your pen. Here, then, is the possibility of a universe even greater and even more intricate than you may have ever known — Whether you can reach that universe or not? That is a separate question — and none but experience can teach you its answer.

Your Teacher’s Thoughts towards the person he likes (but he doesn’t know?)

The feeling of love for someone is not something that you just go right ahead and deny. I don’t think it’s something that you should be shy about: That you like a person. Somehow after the years have come to pass it becomes true that somehow or another your ego doesn’t really take that much occupancy. You can admit honestly that you like a person even if you realize that there is no expectation of a return. On my part, I don’t know; even as a teacher, I really like a couple of different people here and there, but I realize clearly that there might not be really an expectation of a return, and that’s okay, that’s just how I am – it might be strange to think about, but even your teacher might have emotions, and so too might the rest of the world. It is a little unconventional to reveal your emotions, especially in a world like this, but to the person I do like, I like you, but I realize that I should not put you down, I should not bind you, I should not stop you from being who you want to be. If we come together, it is because somehow or another, through the millions of possible pathways, and somehow through the conversations, we liked each other, and that is enough for me and is an act of fortune, not of planning or otherwise. In the past, I would’ve been afraid of saying that I like a person or I wouldn’t have been so honest with my emotions. Nowadays, I don’t know if it’s because I’m old now, but I think it’s okay to say that I like people and I’m not too afraid of saying that I do because that’s just what it really is – an expression of emotion and a reality that I […]

The things we like but are not good at.

In this world, as we pass through, we may realize that there are some activities that we deeply and truly love – little skills, hobbies, and occupations that pique our minds, hearts, and souls when we participate in them as an act of pleasure.  As we pass through the tides of time, though, almost inevitably we come to realize that simply because someone we enjoy something, that doesn’t mean that we are going to be good at it. In fact, that’s an understatement.  Why are we talking about good when actually we can be horribly, devastatingly, and world-changingly catastrophic at it?  Here the realization inevitably comes, almost as if it were the common heritage of humanity: Just because you like something, that does not mean that you will be good at it.  One might argue that a true passion is such that even if one isn’t good at something, that the passion should stay.  Even if you are a horrible dancer, that does not mean that you should despise dancing.  The words of an eternal Malay proverb come to mind, “Tidak tahu menari, memarahkan lantai.” They resonate through the core of our beings and remind us:  If you dance horribly, that does not mean that you should blame the floor.  In other words, our lack of skill is no justification for our preferences, which are shown superficial if being bad at them is our grounds for casting them away.  After all, are we not like the fox, that declared the grapes sour, purely because we could not reach them?  In a way, this may be true, but a reality is that in this world, skills are not necessarily their own reward, and imagining that they are is to neglect the realities of our universe in lieu of something all too idealistic, rarefied, and divorced from both the world and the way […]