Monday 17 November 2008

Total Voice Recognition

I have firmly chosen my idea, it is to be the Total Voice Recognition.

Recent technologies in the area of voice activated computers are much better than they have been. In talk-to-type systems like 'Dragon Naturally Speaking 10' there is a much higher success rate than before, with now just 1% of words being mis-heard. There is also a limited increase in the Artificial Intelligence of the software; you can give instructions such as "bold that" or "left align that" which can tell the difference between dictation and instruction based on tone of voice.

There is also software available for Mac called Dictate which goes the extra step and is used across the operating system, to open software or even select tools in Photoshop. It is combined with the computer's own speech software, which means you can ask the time and be told by the machine, without using anything but a microphone.

My idea has been slightly covered already, and so I must think of the next stage in this speech control evolution. Multi-Touch systems combined with Total Voice Recognition would alleviate the need for a mouse or keyboard altogether, but it could go further and limit the number of software interfaces needed. The biggest and most obvious example would be the internet: If you need a quick piece of information such as a train time you could simply ask the computer, who would understand the question, search the internet and respond as quickly as possible. You could set up a system of favourites (e.g. railenquiries.com) so the search, and therefore the response you are after are instant.
You: "Computer, When is the next train from Brighton to Victoria?"
Computer: "The next train from Brighton to Victoria is at 3.40"
You: "Thanks"

The result of this service would mean individual websites of data could be avoided, rather people would get all of their information via one source. Potentially, if the software was intelligent enough, blind internet users could get information as quickly and easily as everyone else.

The progression from there would be that you have a separate machine, a computer with no display, that works only on a speech input and speech output system, searching the internet for information and relaying it back to you. It would have to work 100% of the time, or else not be worth having, so the intelligence of the searches would have to be outstanding. This is currently not likely to happen, but is certainly possible in the future.

Dissertation

This morning we ran through initial ideas for our dissertations. I hadn't thought about it until asked this morning, but I had two thoughts, both of which might be problematic: I thought maybe I'd write about mathematics, especially phi in design of all kinds, but this idea has been taken by somebody else, which could prove problematic. I also considered writing about TV, especially American TV series such as HBO's The Wire, The Sopranos and Deadwood. There are loads of ideas I have on this subject, but the problem is that I wrote an essay on this topic in the first year of the course, and I might plagiarise myself.

This means I either have to come up with a new question and set of research, or think of a new topic altogether.

Tuesday 11 November 2008

I haven't updated for a while, but here are my most recent activities.

There were 6 ideas:

1. Social TV - Chat whilst flicking through channels in case you find something good. See what your friends are watching.

2. Digital Odours - Smells can be captured by a device, then emailed and reproduced by others. Could also be used in cinema, TV or consoles to aid immesion in the media.

3. Total Voice Recognition - Operate the entire computer with muti-touch and voice, or voice alone.

4. 4D imagery - Impossible for humans to see, a computer will be able to generate 4 dimensional imagery.

5. Future Internet Dating - Virtual reality dates, using face-mapping to ease cyber-couples into real life meetings.

6. Retro Websites - A future movement by those tired of web 2.0's domination, bringing back cloud wallpapers, frames and animated gifs.


Of these 6, I have put the most thought into the Digital Odours and the Total Voice Recognition. I currently want to go with Total Voice Recognition as Digital Odours have already been launched and failed (see Digiscents iSmell and Smell-O-Vision). I also see Total Voice Recognition as both the most interesting and most likely of all my ideas.

I need to think about the AI that would be essential in getting the technology to work. This means I need to look at the way people type or write, and how that is different from how they speak, and how a system could work that speeds up rather than slows down the process.

I should not get caught up in the technical/electronic side of things, as this technlogy is currently unavailable and I am not an inventor. That being said, I have thought of a solution to the technical problem of many people in an office/class talking at once: Use vocoder technology, like in the Talk Box, which requires you only to mouth the words rather than speak them aloud.

It can also be used to create funky sounds.
http://uk.youtube.com/watch?v=RQfZRRRo_8A&feature=related

Monday 27 October 2008

Dystopia

In this project there is much potential to look at past visions of the future, and the often satirical or ridiculous inventions and ways of life portrayed in novels and films. The dystopian futures shown in many of these pieces are often reflections on the time that they were made or written, and often not genuine predictions. That being said, aspects of these fictional works have come true long after they were thought up.

The most obvious example of a dystopian future coming true is 1984. George Orwell's 1948 novel had grim visions of the mid eighties, most prevalently the idea of 'Big Brother'. This idea, at the time just speculation, was essentially that of modern day CCTV - the thought of constantly being watched. This was combined with other concepts made famous by socialist authors after the First World War, novels such as Aldous Huxley's 'Brave New World' and Yevgeny Zamyatin's 'We' use the concepts of '2 + 2 = 5', and people being numbered and entered into giant government databases. All of these concepts are beginning to come into play, with the idea of ID cards in the UK being the latest step towards a 1984 for this century.

One film that's imagery and ideas have survived into the future they were predicting is Terry Gilliam's 1985 comedy Brazil. Apparently Brazil was once under the working title of 'Monty Python's 1984' and you can see why. As well as the pollution, Brazil makes satire of beaurocracy, with the single nationwide corporation 'Ministry of Information'. There is also a big emphasis on terrorist bombings, which have been in the news every day since 2001.

With these dystopian works of fiction being some of the best predictions of the future, perhaps for my Design Futures I should look at what we would not like to see, or if all the things we don't like at the moment being continued for 10 years. That way I might actually make an accurate prediction.

Sunday 26 October 2008

PC World - The 25 Worst Tech Products of All Time

http://www.pcworld.com/article/125772/the_25_worst_tech_products_of_all_time.html

The idea of looking at this article was to see what makes a bad media-related invention. It emerged during a search on the subject of iSmell (which finished 24th), but includes software as well as physical creations. There are some features which are apparent across the list; most fundamentally is the concept itself, some of which are quite ridiculous. Priceline.com were selling cheap petrol online, it would arrive a few days after ordering in the form of a card which could only then be used in a very select few petrol stations. Another reason for many of the things being on this list is that they were badly made and would often break, which is not the sign of a bad idea, but of bad manufacturing and is not really to be considered when thinking about Design Futures.

There are two more reasons for concepts making this unfortunate quarter-century, both of which are through the tech being released too early. In the case of some software, such as (no. 4) Disney's The Lion King CD-ROM for Windows, there was a requirement for a cutting edge graphics card that was mostly unobtainable for everyday Windows users. Hardware technologies have also failed through being released too soon, like in the case of iSmell, which didn't catch on because the technology was ahead of it's time, and cost too much for the consumer. However, it is possible that people also just don't want to smell websites. I think if angled correctly, marketed well and made both available and cheap, the idea of digital smells could catch on.

Tuesday 21 October 2008

Digital Futures: iSmell

It has always been the case that people consuming media want to be immersed as much as possible in whatever form of entertainment they are using. In film, the addition of sound and colour made going to the cinema a much richer experience. These standards have improved with surround sound and sharper quality images, but there are still fundamental improvements that could be made. In 1960, Hans Laube's Smell-o-vision was the first attempt at including a third sense in the viewing experience, it didn't catch on. There was a much later attempt in 2001 called iSmell which used receptors and the equivilent of printers mixing 128 'Primary odours' to transform smells into data and then back into synthetic smells. It is not apparent why this technology has not progressed, perhaps it has limited appeal and the hardware is too expensive for domestic use. If it were still in play these days, perhaps users of Facebook might have their smell as well as their image available to download.

Monday 13 October 2008

Lev Manovich - The paradoxes of digital photography

PUBLISHED IN:
Photography After Photography.Exhibition catalog.
Germany, 1995.
Available http://www.manovich.net/TEXT/digital_photo.html
--------------------------------------------------------------------
Lev Manovich
THE PARADOXES OF DIGITAL PHOTOGRAPHY

In this text, written in 1994, Manovich discusses the differences between digital and analogue photography. He asks whether digital photography and film making are more or less 'real' than their analogue predecessors, as new methods enable easier manipulation and editing of image.
Much of the essay is about the then emerging technology of digital photography; most of his references (Jurassic Park is the height of CGI at the time of writing) and some points have been made obsolete by developments in both photographic software and hardware since 1994, but the theories are mostly still relevant, and the questions unanswered.

There are some sections of the essay that I found more interesting and relevant than others. Manovich's views on realism, and the ideas of 'photorealism' and 'hyperrealism' are still in play even today. Photorealism is the idea that 3D or digital imagery is aiming not to replicate real objects, but photos of those objects. Hyperrealism is when imagery is too clear, clean or even realistic to look like something the eyes are used to. Techniques to combat hyperrealism have been put in place, include 'faking' depth of field by adding pseudo focus, and adding motion blur to 3D animation and CGI to simulate film footage.

The idea of digital imagery's aspiration of realism or photorealism is, however, untrue in more recent entirely 3D animations such as Finding Nemo. As the 3D images are not on screen with film footage, the aim is no longer realism or photorealism, and the imagery becomes cartoon-like. That being said, conventions of film such as camera movement are unnecessarily in place, perhaps to justify the medium to film traditionalists.

The points made by Manovich seem very accurate of today considering it was written 14 years ago. I can perhaps use some of his theories to help predict what will happen next. Instead of thinking of gaps in the market, think of how technology has developed, and why society accept some advances and discard others. This could for practical reasons or the principles behind the new concepts. Maybe, rather than making things smaller, faster or combining technologies, look at aspects such as broader appeal, widespread availability, or the social factors within media.