US20140156045A1 - Digital audio communication system - Google Patents

Digital audio communication system Download PDF

Info

Publication number
US20140156045A1
US20140156045A1 US13/904,484 US201313904484A US2014156045A1 US 20140156045 A1 US20140156045 A1 US 20140156045A1 US 201313904484 A US201313904484 A US 201313904484A US 2014156045 A1 US2014156045 A1 US 2014156045A1
Authority
US
United States
Prior art keywords
user
transmissions
accessing
audio
button
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/904,484
Inventor
Yoshinari Yoshikawa
Kaoru Zeren
Justin Mayer
Toshiaki Takada
Noriyuki Okada
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Miselu Inc
Original Assignee
Miselu Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Miselu Inc filed Critical Miselu Inc
Priority to US13/904,484 priority Critical patent/US20140156045A1/en
Priority to US13/923,084 priority patent/US20140165010A1/en
Assigned to MISELU, INC. reassignment MISELU, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MAYER, JUSTIN, OKADA, NORIYUKI, TAKADA, TOSHIAKI, YOSHIKAWA, YOSHINARI, ZEREN, KAORU
Publication of US20140156045A1 publication Critical patent/US20140156045A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06F17/3074
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/435Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/04817Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance using icons
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04842Selection of displayed objects or displayed text elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72469User interfaces specially adapted for cordless or mobile telephones for operating the device by selecting functions from two or more displayed items, e.g. menus or icons

Definitions

  • Embodiments of the present invention allow users to post or publish audio information to a destination on a digital network.
  • a particular embodiment provides a user interface for recording and uploading a short comment, remark, song segment, sound effect or any other audio portion.
  • the comment can be sent directly to another user's device or can be published or uploaded to a network site, web page, user group or other location.
  • a user interface allows organizing, reviewing, editing, tagging, transferring and other types of processing or manipulation in association with the audio portion to be transferred, or which has been received.
  • text, images, geographic location or other information or content can be tagged or otherwise associated with the audio portion to provide additional options or features.
  • Lists and timelines are used to help create and organize the audio portions.
  • One user interface for a portable computing device allows a user to record an audio portion of a limited duration. A progress bar indicates the time remaining during audio recording. Once the recording has been completed the user can associate additional content with the audio portion and transfer the audio portion to a destination.
  • FIG. 1A is a first screen display of a user interface of a particular embodiment
  • FIG. 1B is a second screen display of a user interface of a particular embodiment
  • FIG. 1C is a third screen display of a user interface of a particular embodiment
  • FIG. 1D is a fourth screen display of a user interface of a particular embodiment
  • FIG. 1E is a fifth screen display of a user interface of a particular embodiment
  • FIGS. 1A-E illustrate screen displays of a user interface of a particular embodiment.
  • Particular embodiments are manufactured and/or distributed by Miselu, Inc., of Mountain View, Calif. It should be apparent that many variations on specific controls, displays, audio processing, functional steps or other inputs/outputs and steps or actions are possible and may be within the scope of the invention.
  • a particular type of device e.g., a cell phone
  • any other suitable digital processing device may be used.
  • a particular input mode may be described, such as tapping a button or sliding a control, in general any type of user input device, control, sensing or activation mechanism may be employed unless otherwise noted.
  • alternative devices may use voice activation, gesture recognition, facial recognition, three dimensional or virtual reality input or output, etc.
  • FIG. 1A illustrates a selection icon 100 that corresponds to an audio recording interface.
  • selection icon 100 can reside in an operating system environment on a device such as the cell phone pictured in FIG. 1A that includes several other icons or selections which correspond to other functionality or applications.
  • Other devices may be suitable for use with features described herein.
  • any suitable computing system such as a desktop, laptop, notebook, sub-notebook, ultra-portable, tablet or other computer; personal digital assistant (PDA), music player, camera or other type of device may be used.
  • PDA personal digital assistant
  • a dedicated hardware system may be employed that is designed primarily or exclusively for audio recording and transfer.
  • Title bar 110 indicates that the screen in FIG. 1B is the home screen for the audio recording interface.
  • “Play New” button 120 When activated, this button initiates playback of the most recent 20 unheard recordings (number is user-configurable).
  • Record button 130 is prominently displayed as a large button since it is usually of primary interest to a user after the user has selected icon 100 .
  • a list 150 of previous transmissions is shown in reverse chronological order below the record button. Each row in the list corresponds to an entry that includes an audio portion that was transferred—either sent or received by the user of the device.
  • each entry in list 150 includes image 140 of the originator of the associated audio portion corresponding with the entry.
  • Text 142 includes the originator's name, date of transmission, and location (e.g., city/state) of transmission.
  • additional information can be added as “tags” such as image tag 144 , or geo-tag 146 .
  • geo-tags such as 146 can be automatically generated by the device by using a location sensing mechanism (e.g., Global Positioning System (GPS), cell tower triangulation, WiFi/hotspot triangulation, etc.).
  • GPS Global Positioning System
  • WiFi/hotspot triangulation etc.
  • Tags such as image, geo, text or other content can be automatically or manually generated, or they may be generated by a combination of automatic and manual steps such as where the device asks the user if the user wants to allow geo-tagging where the device's location has been automatically sensed and associated with a graphical map location.
  • the audio portion corresponding with an entry can be played back by tapping in the whitespace of the entry. It should be apparent that many variations on ways to display audio portion entries are possible.
  • Navigation buttons at the bottom of the screen include Home Page button 160 , News button 162 , Search button 164 and More button 166 . These provide the user with a quick way to jump to other pages or screens that may be associated with audio portion recording and transmissions.
  • Home Page button 160 brings the user to the page shown in FIG. 1B .
  • News button 162 opens a section where users can have recent news headlines played back to them via text-to-voice synthesis.
  • Search button 164 allows the user to search entries by using keywords, tags or metadata, or other options.
  • More button 166 displays additional navigation buttons. Any number and type of navigation buttons may be provided to allow quick access to one or more of the features described herein, or to allow the user to access other functions with the device.
  • FIG. 1C the screen display of FIG. 1C is presented.
  • Record button 130 has been replaced with Stop button 200 and recording has been initiated automatically.
  • Meter bar 210 shows the audio input level while recording. This meter can indicate, for example, a signal strength being received by a microphone in the device.
  • Progress bar 220 lights up successive dots from left to right as time elapses and the recording is underway.
  • a predetermined fixed interval of 5 seconds is used as the recording interval.
  • Other embodiments can vary the time interval.
  • a restricted time interval may be useful in order to simplify the interface and to prevent long (either intentional or unintentional) recordings from being created and sent.
  • an interval of 5 seconds has been found to allow a reasonable amount of voice comment without being too restrictive.
  • Other embodiments may allow different fixed intervals in the range of 3-60 seconds. Other intervals may be used.
  • interval duration can be used in different approaches. For example, service providers, device manufacturers, site operators, application developers (e.g., email, chat, etc.) can set the interval duration. Or the user can be allowed to change the duration. In some systems the user may be charged depending upon how long a duration each audio interval is set. An administrator or someone with group privileges may set the duration and other usage restrictions if, for example, the device is used in a company.
  • service providers e.g., device manufacturers, site operators, application developers (e.g., email, chat, etc.) can set the interval duration.
  • the user can be allowed to change the duration.
  • the user may be charged depending upon how long a duration each audio interval is set.
  • An administrator or someone with group privileges may set the duration and other usage restrictions if, for example, the device is used in a company.
  • the interval can be a “soft” interval such that if a user continues to talk past the end of the interval the audio is still recorded for a small amount of time.
  • the audio can be made to be cut off completely at the end of the interval or can fade out.
  • Other approaches are possible.
  • recording has ended as a result of the interval duration being reached as shown by progress bar 250 .
  • the Stop button has changed back to Record button 230 .
  • Meter bar 240 is turned off to show that no signal is being recorded. Alternatively, the meter bar can remain enabled so the user can see the signal level to know how close or far to position the device from their mouth or other sound source in order to make a recording of suitable volume.
  • Content buttons for adding a photo or tag are shown at 260 and 270 , respectively.
  • the recording can be played back so the user can check that they are satisfied with the recording. Then the user can select Done button 280 to save the recording or a “Send To” button (not shown) to send the audio portion just recorded to the last person from whom the user's device has received an audio portion.
  • the user can select a recipient from a list such as an address book or contacts list, from the entries list in FIG. 1B , by typing in all or a portion of a person's name, or by other means. If the user wishes to re-record the audio portion then Record button 230 can be selected and the actions described above in reference to FIG. 1C can be repeated to create another audio recording to be used in place of the last one, which is discarded.
  • each recording may be saved in a history or similar list so that the user can choose from among which one of several “takes” to select for transmission.
  • FIG. 1E shows the screen display after the user has selected Add Photo button 260 of FIG. 1D .
  • image 330 has been captured by the device's camera and is shown on the screen as it will appear as an item associated with the recorded audio portion when the audio portion is sent.
  • Other ways to associate a photo or image with the audio portion are possible.
  • the user can select the image from a collection of images in the devices file system, from a network location (e.g., web site), from a different application (e.g., email, photo organizer), etc.
  • Add Tags button Other types of content or metadata can be associated with the audio portion by using the Add Tags button.
  • text keywords can be associated with the audio portion's entry.
  • a message, words, characters or other symbols can be typed or drawn and included as part of the audio portion entry.
  • top-level navigation buttons such as Home, Archive, Replies, and Friends can be provided.
  • the Home button can return the user to the home page as described above.
  • the display on the home page can include a reverse chronological list of friends' recordings.
  • An Archive button can provide a page with a reverse chronological list of the user's own recordings. If the device is turned horizontally then a timeline view can be presented showing the occurrences of the user's recordings spaced according to when the recordings were made.
  • a Replies button can provide a page that shows threads of conversations between users.
  • a Friends button can provide a page that is used to manage friend and group lists.
  • a More button can be used to display additional pages or options such as a Preferences page to set user preferences or allow configuration of buttons.
  • all lists have 20 tracks per screen by default, with pagination. This number can vary among different applications, versions, in response to user preference setting, depending upon device screen size or orientation, etc.
  • Tapping on a user's photo icon can produce a reverse chronological list of recent recordings by the user.
  • Tapping on an area to the right-hand side of the screen can show a photo if one is associated with the audio, or a default map image showing the location of a user when the user made the audio recording.
  • a “Play New” button can be included in the top navigation for all lists (e.g., lists for Latest, Archive, User). Tapping the Play New button can play all the unheard tracks in the current list sequentially in a particular order such as reverse chronological order, or chronological order. For example, if viewing a list of 20 tracks, nine of which have not yet been heard by the user, tapping the Play New button can play the nine unheard tracks in reverse chronological order. If an audio track is spoken voice, a right-facing “play” arrow icon can appear at the right of a row in the list. If it is a music track, a musical note icon can be shown instead.
  • a user can bring up a list of the latest (e.g., most recently made or most recently received) recordings.
  • Each recording can be associated with a recording user's photo icon as described above. Tapping on the photo icon brings up a list of all recordings sent to the user by the recording user who is associated with the photo icon.
  • Video capture can be provided.
  • recordings can be cached to a local file system for later automatic upload.
  • Text included as meta-data associated with a recording can be searched.
  • a text-to-speech process can allow searching of words or phrases in recordings.
  • a user can create a group, such as a group of the user's grandchildren that allows listening to all audio clips from members of the group by selecting the group or pressing a button or icon associated with the group.
  • Playback of multiple new (i.e., not listened to yet) recordings from the group members can be in reverse chronological order of receipt. Or could playback can be according to each member so that all recordings form a group member are played back first and then the next member's recordings, and so on.
  • One feature can provide a way to either notify the user that the upload didn't happen or hold the recording until the user enters an area where there is coverage and then complete the upload.
  • the upload can have a time and date stamp so that the location of the user when the recording was made can be extrapolated by estimating rate of travel with the present location at upload, present time at upload and prior time stamp of the recording.
  • a map display can be used as the basis for the user interface.
  • a user can run a finger along interested areas of the map like a theme park's location. Voices captured from the area will be replayed as the finger runs over the recorded section allowing you to get a sense of how people are feeling about that location. No need to know the identity of the people speaking on the recordings.
  • recordings can be played at volumes that are proportional to the distance the recording was recorded from the user's current position. Voices that were recorded from farther away are softer while voices that were recorded from nearer locations are louder.
  • a filter can be used to select or block recordings with different types of moods. For example, “happy,” or “excited” types of recordings can be selected or blocked.
  • the classification of such voice attributes can be by using text metadata entered by the speakers, by using human or automated classification techniques, etc.
  • FIG. 2 shows basic hardware that can be used to practice embodiments of the invention.
  • device 400 includes processor 404 coupled to display 402 , storage 406 , audio input 408 , audio output 410 and user input 412 .
  • processor 404 coupled to display 402 , storage 406 , audio input 408 , audio output 410 and user input 412 .
  • any suitable types of present or future components can be used to achieve the functionality of the subsystems shown in FIG. 2 .
  • the interconnection of these subsystems can vary as a matter of design choice.
  • subsystems may be omitted from the device. For example, if a device is only being used to record and send then audio output 410 can be omitted. Additional subsystems or components can be included in the device.
  • Device 400 is in communication with other devices 432 , 434 , 436 having similar functionality via network 420 that can be a digital network such as the Internet, a LAN or other network or communication scheme.
  • network 420 can be a digital network such as the Internet, a LAN or other network or communication scheme.
  • any type of communication system can be used such as wired, wireless, computer network, phone system, etc. It should be apparent that many variations are possible without deviating from the scope of the claimed invention.
  • routines of particular embodiments including C, C++, Java, assembly language, etc.
  • Different programming techniques can be employed such as procedural or object oriented.
  • the routines can execute on a single processing device or multiple processors in one or more same or different locations.
  • steps, operations, or computations may be presented in a specific order, this order may be changed in different particular embodiments. In some particular embodiments, multiple steps shown as sequential in this specification can be performed at the same time.
  • Particular embodiments may be implemented in a computer-readable storage medium for use by or in connection with the instruction execution system, apparatus, system, or device.
  • Particular embodiments can be implemented in the form of control logic in software or hardware or a combination of both.
  • the control logic when executed by one or more processors, may be operable to perform that which is described in particular embodiments.
  • Particular embodiments may be implemented by using a programmed general purpose digital computer, by using application specific integrated circuits, programmable logic devices, field programmable gate arrays, optical, chemical, biological, quantum or nanoengineered systems, components and mechanisms may be used.
  • the functions of particular embodiments can be achieved by any means as is known in the art.
  • Distributed, networked systems, components, and/or circuits can be used.
  • Communication, or transfer, of data may be wired, wireless, or by any other means.

Abstract

Embodiments of the present invention allow users to post or publish audio information to a destination on a digital network. A particular embodiment provides a user interface for recording and uploading a short comment, remark, song segment, sound effect or any other audio portion. The comment can be sent directly to another user's device or can be published or uploaded to a network site, web page, user group or other location. A user interface allows organizing, reviewing, editing, tagging, transferring and other types of processing or manipulation in association with the audio portion to be transferred, or which has been received.

Description

    CLAIM OF PRIORITY
  • This application is a continuation of U.S. patent application Ser. No. 12/557,445 filed on Sep. 10, 2009, entitled “DIGITAL AUDIO COMMUNICATION SYSTEM” which claims priority from U.S. Provisional Patent Application No. 61/095,755 filed on Sep. 10, 2008, entitled “DIGITAL AUDIO COMMUNICATION SYSTEM” both of which are hereby incorporated by reference in their entirety.
  • BACKGROUND
  • The immense popularity and usefulness of digital networks such as the Internet, corporate and campus local area networks (LANs), home networks, wireless networks (e.g., Bluetooth, Fire Wire 802.11x, ad hoc (computer-to-computer)), etc. has resulted in many communication benefits. Digital transmission and processing systems allow users of networks to exchange information in many forms. For example, text and images have traditionally been highly used and there are many mechanisms in use today for users to exchange text such as email, documents, text messages, blog posting, etc. Images can also be readily exchanged in the form of graphics, photographs, slides, video, etc. However, the exchange of audio information has usually focused on transferring discrete files, such as songs, lectures, video, etc., or has been the subject of real-time exchanges such as with Internet Protocol (IP) phones or other digital conversation methods.
  • SUMMARY
  • Embodiments of the present invention allow users to post or publish audio information to a destination on a digital network. A particular embodiment provides a user interface for recording and uploading a short comment, remark, song segment, sound effect or any other audio portion. The comment can be sent directly to another user's device or can be published or uploaded to a network site, web page, user group or other location. A user interface allows organizing, reviewing, editing, tagging, transferring and other types of processing or manipulation in association with the audio portion to be transferred, or which has been received.
  • For example, in one embodiment text, images, geographic location or other information or content can be tagged or otherwise associated with the audio portion to provide additional options or features. Lists and timelines are used to help create and organize the audio portions. One user interface for a portable computing device allows a user to record an audio portion of a limited duration. A progress bar indicates the time remaining during audio recording. Once the recording has been completed the user can associate additional content with the audio portion and transfer the audio portion to a destination.
  • A further understanding of the nature and the advantages of particular embodiments disclosed herein may be realized by reference to the remaining portions of the specification and the attached drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1A is a first screen display of a user interface of a particular embodiment;
  • FIG. 1B is a second screen display of a user interface of a particular embodiment;
  • FIG. 1C is a third screen display of a user interface of a particular embodiment;
  • FIG. 1D is a fourth screen display of a user interface of a particular embodiment;
  • FIG. 1E is a fifth screen display of a user interface of a particular embodiment;
  • DETAILED DESCRIPTION OF EMBODIMENTS
  • FIGS. 1A-E illustrate screen displays of a user interface of a particular embodiment. Particular embodiments are manufactured and/or distributed by Miselu, Inc., of Mountain View, Calif. It should be apparent that many variations on specific controls, displays, audio processing, functional steps or other inputs/outputs and steps or actions are possible and may be within the scope of the invention. Although a particular type of device, e.g., a cell phone, is used for purposes of illustration, it should be apparent that any other suitable digital processing device may be used. Although a particular input mode may be described, such as tapping a button or sliding a control, in general any type of user input device, control, sensing or activation mechanism may be employed unless otherwise noted. For example, alternative devices may use voice activation, gesture recognition, facial recognition, three dimensional or virtual reality input or output, etc.
  • FIG. 1A illustrates a selection icon 100 that corresponds to an audio recording interface. As is known in the art, selection icon 100 can reside in an operating system environment on a device such as the cell phone pictured in FIG. 1A that includes several other icons or selections which correspond to other functionality or applications. Other devices may be suitable for use with features described herein. For example, any suitable computing system such as a desktop, laptop, notebook, sub-notebook, ultra-portable, tablet or other computer; personal digital assistant (PDA), music player, camera or other type of device may be used. In other embodiments, a dedicated hardware system may be employed that is designed primarily or exclusively for audio recording and transfer.
  • Once a user has selected icon 100, the screen shown in FIG. 1B is displayed. Title bar 110 indicates that the screen in FIG. 1B is the home screen for the audio recording interface. Within title bar 110 is “Play New” button 120. When activated, this button initiates playback of the most recent 20 unheard recordings (number is user-configurable). Record button 130 is prominently displayed as a large button since it is usually of primary interest to a user after the user has selected icon 100. A list 150 of previous transmissions is shown in reverse chronological order below the record button. Each row in the list corresponds to an entry that includes an audio portion that was transferred—either sent or received by the user of the device.
  • In a particular embodiment, each entry in list 150 includes image 140 of the originator of the associated audio portion corresponding with the entry. Text 142 includes the originator's name, date of transmission, and location (e.g., city/state) of transmission. additional information can be added as “tags” such as image tag 144, or geo-tag 146. In a particular embodiment, geo-tags such as 146 can be automatically generated by the device by using a location sensing mechanism (e.g., Global Positioning System (GPS), cell tower triangulation, WiFi/hotspot triangulation, etc.). Tags such as image, geo, text or other content can be automatically or manually generated, or they may be generated by a combination of automatic and manual steps such as where the device asks the user if the user wants to allow geo-tagging where the device's location has been automatically sensed and associated with a graphical map location. The audio portion corresponding with an entry can be played back by tapping in the whitespace of the entry. It should be apparent that many variations on ways to display audio portion entries are possible.
  • Navigation buttons at the bottom of the screen include Home Page button 160, News button 162, Search button 164 and More button 166. These provide the user with a quick way to jump to other pages or screens that may be associated with audio portion recording and transmissions. For example, Home Page button 160 brings the user to the page shown in FIG. 1B. News button 162 opens a section where users can have recent news headlines played back to them via text-to-voice synthesis. Search button 164 allows the user to search entries by using keywords, tags or metadata, or other options. More button 166 displays additional navigation buttons. Any number and type of navigation buttons may be provided to allow quick access to one or more of the features described herein, or to allow the user to access other functions with the device.
  • Assuming the user has pressed Record button 130 of FIG. 1B, the screen display of FIG. 1C is presented. In FIG. 1C, Record button 130 has been replaced with Stop button 200 and recording has been initiated automatically. Meter bar 210 shows the audio input level while recording. This meter can indicate, for example, a signal strength being received by a microphone in the device. Progress bar 220 lights up successive dots from left to right as time elapses and the recording is underway. In a particular embodiment a predetermined fixed interval of 5 seconds is used as the recording interval. Other embodiments can vary the time interval. A restricted time interval may be useful in order to simplify the interface and to prevent long (either intentional or unintentional) recordings from being created and sent. In a particular embodiment, an interval of 5 seconds has been found to allow a reasonable amount of voice comment without being too restrictive. Other embodiments may allow different fixed intervals in the range of 3-60 seconds. Other intervals may be used.
  • Yet other embodiments can use different approaches to determine interval duration. For example, service providers, device manufacturers, site operators, application developers (e.g., email, chat, etc.) can set the interval duration. Or the user can be allowed to change the duration. In some systems the user may be charged depending upon how long a duration each audio interval is set. An administrator or someone with group privileges may set the duration and other usage restrictions if, for example, the device is used in a company.
  • Other variations on setting the interval are possible. For example, the interval can be a “soft” interval such that if a user continues to talk past the end of the interval the audio is still recorded for a small amount of time. The audio can be made to be cut off completely at the end of the interval or can fade out. Other approaches are possible.
  • In FIG. 1C, as progress bar 220 proceeds to illuminate dots until it reaches the rightmost dot the user can continue speaking or recording while they are provided with a clear indication of how much time is left to record. Once the progress bar reaches the end the screen display changes to that of FIG. 1D.
  • In FIG. 1D, recording has ended as a result of the interval duration being reached as shown by progress bar 250. The Stop button has changed back to Record button 230. Meter bar 240 is turned off to show that no signal is being recorded. Alternatively, the meter bar can remain enabled so the user can see the signal level to know how close or far to position the device from their mouth or other sound source in order to make a recording of suitable volume. Content buttons for adding a photo or tag are shown at 260 and 270, respectively.
  • The recording can be played back so the user can check that they are satisfied with the recording. Then the user can select Done button 280 to save the recording or a “Send To” button (not shown) to send the audio portion just recorded to the last person from whom the user's device has received an audio portion. Alternatively the user can select a recipient from a list such as an address book or contacts list, from the entries list in FIG. 1B, by typing in all or a portion of a person's name, or by other means. If the user wishes to re-record the audio portion then Record button 230 can be selected and the actions described above in reference to FIG. 1C can be repeated to create another audio recording to be used in place of the last one, which is discarded. In other embodiments, each recording may be saved in a history or similar list so that the user can choose from among which one of several “takes” to select for transmission.
  • FIG. 1E shows the screen display after the user has selected Add Photo button 260 of FIG. 1D. In FIG. 1E, image 330 has been captured by the device's camera and is shown on the screen as it will appear as an item associated with the recorded audio portion when the audio portion is sent. Other ways to associate a photo or image with the audio portion are possible. For example, the user can select the image from a collection of images in the devices file system, from a network location (e.g., web site), from a different application (e.g., email, photo organizer), etc.
  • Other types of content or metadata can be associated with the audio portion by using the Add Tags button. For example, text keywords can be associated with the audio portion's entry. A message, words, characters or other symbols can be typed or drawn and included as part of the audio portion entry.
  • Additional features can be included. For example, top-level navigation buttons such as Home, Archive, Replies, and Friends can be provided. The Home button can return the user to the home page as described above. The display on the home page can include a reverse chronological list of friends' recordings. An Archive button can provide a page with a reverse chronological list of the user's own recordings. If the device is turned horizontally then a timeline view can be presented showing the occurrences of the user's recordings spaced according to when the recordings were made.
  • A Replies button can provide a page that shows threads of conversations between users. A Friends button can provide a page that is used to manage friend and group lists. A More button can be used to display additional pages or options such as a Preferences page to set user preferences or allow configuration of buttons.
  • In a particular embodiment, all lists have 20 tracks per screen by default, with pagination. This number can vary among different applications, versions, in response to user preference setting, depending upon device screen size or orientation, etc. Tapping on a user's photo icon can produce a reverse chronological list of recent recordings by the user. Tapping on an area to the right-hand side of the screen can show a photo if one is associated with the audio, or a default map image showing the location of a user when the user made the audio recording.
  • A “Play New” button can be included in the top navigation for all lists (e.g., lists for Latest, Archive, User). Tapping the Play New button can play all the unheard tracks in the current list sequentially in a particular order such as reverse chronological order, or chronological order. For example, if viewing a list of 20 tracks, nine of which have not yet been heard by the user, tapping the Play New button can play the nine unheard tracks in reverse chronological order. If an audio track is spoken voice, a right-facing “play” arrow icon can appear at the right of a row in the list. If it is a music track, a musical note icon can be shown instead.
  • A user can bring up a list of the latest (e.g., most recently made or most recently received) recordings. Each recording can be associated with a recording user's photo icon as described above. Tapping on the photo icon brings up a list of all recordings sent to the user by the recording user who is associated with the photo icon.
  • Other possible features include allowing a user to select a pre-existing photo instead of shooting a new one. Video capture can be provided. When connectivity is poor or unavailable, recordings can be cached to a local file system for later automatic upload. Text included as meta-data associated with a recording can be searched. A text-to-speech process can allow searching of words or phrases in recordings.
  • A user can create a group, such as a group of the user's grandchildren that allows listening to all audio clips from members of the group by selecting the group or pressing a button or icon associated with the group. Playback of multiple new (i.e., not listened to yet) recordings from the group members can be in reverse chronological order of receipt. Or could playback can be according to each member so that all recordings form a group member are played back first and then the next member's recordings, and so on.
  • Users can change their setting on how the playback happens, as some people may want to listen to the latest first for contents such as news. One feature can provide a way to either notify the user that the upload didn't happen or hold the recording until the user enters an area where there is coverage and then complete the upload. The upload can have a time and date stamp so that the location of the user when the recording was made can be extrapolated by estimating rate of travel with the present location at upload, present time at upload and prior time stamp of the recording.
  • A map display can be used as the basis for the user interface. A user can run a finger along interested areas of the map like a theme park's location. Voices captured from the area will be replayed as the finger runs over the recorded section allowing you to get a sense of how people are feeling about that location. No need to know the identity of the people speaking on the recordings. Using position location information for the playback device, recordings can be played at volumes that are proportional to the distance the recording was recorded from the user's current position. Voices that were recorded from farther away are softer while voices that were recorded from nearer locations are louder.
  • A filter can be used to select or block recordings with different types of moods. For example, “happy,” or “excited” types of recordings can be selected or blocked. The classification of such voice attributes can be by using text metadata entered by the speakers, by using human or automated classification techniques, etc.
  • FIG. 2 shows basic hardware that can be used to practice embodiments of the invention. In FIG. 2, device 400 includes processor 404 coupled to display 402, storage 406, audio input 408, audio output 410 and user input 412. In general, any suitable types of present or future components can be used to achieve the functionality of the subsystems shown in FIG. 2. The interconnection of these subsystems can vary as a matter of design choice. In some applications, subsystems may be omitted from the device. For example, if a device is only being used to record and send then audio output 410 can be omitted. Additional subsystems or components can be included in the device.
  • Device 400 is in communication with other devices 432, 434, 436 having similar functionality via network 420 that can be a digital network such as the Internet, a LAN or other network or communication scheme. In general, any type of communication system can be used such as wired, wireless, computer network, phone system, etc. It should be apparent that many variations are possible without deviating from the scope of the claimed invention.
  • Although the description has been described with respect to particular embodiments thereof, these particular embodiments are merely illustrative, and not restrictive.
  • Any suitable programming language can be used to implement the routines of particular embodiments including C, C++, Java, assembly language, etc. Different programming techniques can be employed such as procedural or object oriented. The routines can execute on a single processing device or multiple processors in one or more same or different locations. Although the steps, operations, or computations may be presented in a specific order, this order may be changed in different particular embodiments. In some particular embodiments, multiple steps shown as sequential in this specification can be performed at the same time.
  • Particular embodiments may be implemented in a computer-readable storage medium for use by or in connection with the instruction execution system, apparatus, system, or device. Particular embodiments can be implemented in the form of control logic in software or hardware or a combination of both. The control logic, when executed by one or more processors, may be operable to perform that which is described in particular embodiments.
  • Particular embodiments may be implemented by using a programmed general purpose digital computer, by using application specific integrated circuits, programmable logic devices, field programmable gate arrays, optical, chemical, biological, quantum or nanoengineered systems, components and mechanisms may be used. In general, the functions of particular embodiments can be achieved by any means as is known in the art. Distributed, networked systems, components, and/or circuits can be used. Communication, or transfer, of data may be wired, wireless, or by any other means.
  • It will also be appreciated that one or more of the elements depicted in the drawings/figures can also be implemented in a more separated or integrated manner, or even removed or rendered as inoperable in certain cases, as is useful in accordance with a particular application. It is also within the spirit and scope to implement a program or code that can be stored in a machine-readable medium to permit a computer to perform any of the methods described above.
  • As used in the description herein and throughout the claims that follow, “a”, “an”, and “the” includes plural references unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
  • Thus, while particular embodiments have been described herein, latitudes of modification, various changes, and substitutions are intended in the foregoing disclosures, and it will be appreciated that in some instances some features of particular embodiments will be employed without a corresponding use of other features without departing from the scope and spirit as set forth. Therefore, many modifications may be made to adapt a particular situation or material to the essential scope and spirit.

Claims (11)

1-18. (canceled)
19. A method for sorting transmissions received by an electronic device, comprising:
providing an electronic device;
receiving with the electronic device a plurality of transmissions, each transmission having metadata associated therewith, wherein the metadata for each of the plurality of transmissions includes a user's relationship with an originator of the transmission; and
accessing the plurality of transmissions in an order based on the metadata, wherein the accessing includes displaying information about at least a portion of the transmissions;
wherein the transmissions can be accessed in groups based on the metadata and wherein the groups are based on a user's relationship with an originator of the transmission.
20. A method as defined in claim 19, wherein the accessing includes displaying information about at least a portion of the transmissions in a list.
21. A method as defined in claim 19, wherein the user's relationship includes the user's family relationship.
22. A method as defined in claim 19, wherein the plurality of transmissions are each audio recordings.
23. A method as defined in claim 19, wherein the accessing includes selecting a group of the transmissions based on the metadata.
24. A method as defined in claim 23, wherein the selecting includes selecting a button or icon associated with the group.
25. A method as defined in claim 19, wherein the accessing includes accessing transmissions not yet accessed by a user of the electronic device.
26. A method as defined in claim 25, wherein the accessing of transmissions not yet accessed by a user of the electronic device includes accessing the transmissions in an order based on the time each transmission was received.
27. A method as defined in claim 25, wherein the accessing of transmissions not yet accessed by a user of the electronic device includes accessing the transmissions in a reverse chronological order.
28. A method as defined in claim 19, wherein the accessing includes accessing all the transmissions from one transmission originator, followed by accessing all the transmissions from another transmission originator.
US13/904,484 2008-09-10 2013-05-29 Digital audio communication system Abandoned US20140156045A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US13/904,484 US20140156045A1 (en) 2008-09-10 2013-05-29 Digital audio communication system
US13/923,084 US20140165010A1 (en) 2008-09-10 2013-06-20 Digital audio communication system with improved interface

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US9575508P 2008-09-10 2008-09-10
US12/557,445 US8467402B2 (en) 2008-09-10 2009-09-10 Digital audio communication system
US13/904,484 US20140156045A1 (en) 2008-09-10 2013-05-29 Digital audio communication system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US12/557,445 Continuation US8467402B2 (en) 2008-09-10 2009-09-10 Digital audio communication system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/923,084 Continuation US20140165010A1 (en) 2008-09-10 2013-06-20 Digital audio communication system with improved interface

Publications (1)

Publication Number Publication Date
US20140156045A1 true US20140156045A1 (en) 2014-06-05

Family

ID=41799173

Family Applications (3)

Application Number Title Priority Date Filing Date
US12/557,445 Expired - Fee Related US8467402B2 (en) 2008-09-10 2009-09-10 Digital audio communication system
US13/904,484 Abandoned US20140156045A1 (en) 2008-09-10 2013-05-29 Digital audio communication system
US13/923,084 Abandoned US20140165010A1 (en) 2008-09-10 2013-06-20 Digital audio communication system with improved interface

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US12/557,445 Expired - Fee Related US8467402B2 (en) 2008-09-10 2009-09-10 Digital audio communication system

Family Applications After (1)

Application Number Title Priority Date Filing Date
US13/923,084 Abandoned US20140165010A1 (en) 2008-09-10 2013-06-20 Digital audio communication system with improved interface

Country Status (1)

Country Link
US (3) US8467402B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110072129A (en) * 2018-01-22 2019-07-30 上海鹰信智能技术有限公司 Based on vehicle-mounted interconnection push, save the method for playing record, a kind of playback method

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9361295B1 (en) 2006-11-16 2016-06-07 Christopher C. Andrews Apparatus, method and graphical user interface for providing a sound link for combining, publishing and accessing websites and audio files on the internet
US10296561B2 (en) 2006-11-16 2019-05-21 James Andrews Apparatus, method and graphical user interface for providing a sound link for combining, publishing and accessing websites and audio files on the internet
US10270831B2 (en) 2011-04-04 2019-04-23 Soundlink, Inc. Automated system for combining and publishing network-based audio programming
US9524651B2 (en) * 2011-07-25 2016-12-20 Raymond Fix System and method for electronic communication using a voiceover in combination with user interaction events on a selected background
US20130033971A1 (en) 2011-08-05 2013-02-07 Jeffrey Stier System and Method for Managing and Distributing Audio Recordings
KR101939253B1 (en) * 2012-05-21 2019-01-16 엘지전자 주식회사 Method and electronic device for easy search during voice record
WO2013183811A1 (en) 2012-06-08 2013-12-12 Lg Electronics Inc. Portable device and method for controlling the same
US9674587B2 (en) 2012-06-26 2017-06-06 Sonos, Inc. Systems and methods for networked music playback including remote add to queue
CN102811182A (en) * 2012-08-10 2012-12-05 上海量明科技发展有限公司 Method, client and system for playing audio message in instant messaging
US9361371B2 (en) 2013-04-16 2016-06-07 Sonos, Inc. Playlist update in a media playback system
US9247363B2 (en) 2013-04-16 2016-01-26 Sonos, Inc. Playback queue transfer in a media playback system
USD766253S1 (en) * 2013-09-25 2016-09-13 Google Inc. Display panel or portion thereof with a graphical user interface component
JP5959771B2 (en) * 2014-06-27 2016-08-02 株式会社東芝 Electronic device, method and program
CN104410952B (en) * 2014-10-30 2018-06-19 北京蚂蜂窝网络科技有限公司 A kind of system for obtaining user's area-of-interest
CN105337845A (en) * 2015-10-30 2016-02-17 努比亚技术有限公司 Voice commenting server and method
WO2018049606A1 (en) * 2016-09-14 2018-03-22 深圳市大疆创新科技有限公司 Control method, control device, and electronic device
CN107957842B (en) * 2016-10-18 2020-05-22 腾讯科技(深圳)有限公司 User generated content display method and terminal equipment
WO2019037135A1 (en) * 2017-08-25 2019-02-28 腾讯科技(深圳)有限公司 Picture file management method and terminal, and computer storage medium
CN108965706B (en) * 2018-07-19 2020-07-07 北京微播视界科技有限公司 Video shooting method and device, terminal equipment and storage medium
CN109063082B (en) * 2018-07-25 2021-02-09 珠海格力电器股份有限公司 Page skipping method and terminal equipment
US11269590B2 (en) * 2019-06-10 2022-03-08 Microsoft Technology Licensing, Llc Audio presentation of conversation threads
CN111222060B (en) * 2019-12-31 2023-09-01 维沃移动通信有限公司 Information management method, electronic device and medium
US11595592B2 (en) * 2020-09-15 2023-02-28 Snap Inc. Recorded sound thumbnail

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030128820A1 (en) * 1999-12-08 2003-07-10 Julia Hirschberg System and method for gisting, browsing and searching voicemail using automatic speech recognition
US20030177008A1 (en) * 2002-03-15 2003-09-18 Chang Eric I-Chao Voice message processing system and method
US20070121651A1 (en) * 2005-11-30 2007-05-31 Qwest Communications International Inc. Network-based format conversion
US20080304808A1 (en) * 2007-06-05 2008-12-11 Newell Catherine D Automatic story creation using semantic classifiers for digital assets and associated metadata
US7995720B2 (en) * 2007-02-16 2011-08-09 At&T Intellectual Property I, L.P. Methods, systems, and products for notifications

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2330981B (en) * 1997-10-31 2002-07-03 Nokia Mobile Phones Ltd A radiotelephone handset
GB2330982B (en) * 1997-10-31 2002-02-06 Nokia Mobile Phones Ltd A radiotelephone handset
US7035666B2 (en) * 1999-06-09 2006-04-25 Shimon Silberfening Combination cellular telephone, sound storage device, and email communication device
US6621508B1 (en) * 2000-01-18 2003-09-16 Seiko Epson Corporation Information processing system
US6522347B1 (en) * 2000-01-18 2003-02-18 Seiko Epson Corporation Display apparatus, portable information processing apparatus, information recording medium, and electronic apparatus
GB2409365B (en) * 2003-12-19 2009-07-08 Nokia Corp Image handling
JP4957945B2 (en) * 2005-12-28 2012-06-20 ソニー株式会社 Information processing apparatus, information processing method, program, and recording medium
JP4887779B2 (en) * 2005-12-28 2012-02-29 ソニー株式会社 Information processing apparatus, information processing method, program, and recording medium
JP4844365B2 (en) * 2005-12-28 2011-12-28 ソニー株式会社 Information communication terminal, information communication method, recording medium, and information communication system
US8503625B2 (en) * 2006-06-01 2013-08-06 Microsoft Corporation Managing packet-based voicemail messages
US9106447B2 (en) * 2008-01-03 2015-08-11 Apple Inc. Systems, methods and apparatus for providing unread message alerts
US9342231B2 (en) * 2008-12-29 2016-05-17 Apple Inc. Remote control of a presentation
US8229411B2 (en) * 2008-12-30 2012-07-24 Verizon Patent And Licensing Inc. Graphical user interface for mobile device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030128820A1 (en) * 1999-12-08 2003-07-10 Julia Hirschberg System and method for gisting, browsing and searching voicemail using automatic speech recognition
US20030177008A1 (en) * 2002-03-15 2003-09-18 Chang Eric I-Chao Voice message processing system and method
US20070121651A1 (en) * 2005-11-30 2007-05-31 Qwest Communications International Inc. Network-based format conversion
US7995720B2 (en) * 2007-02-16 2011-08-09 At&T Intellectual Property I, L.P. Methods, systems, and products for notifications
US20080304808A1 (en) * 2007-06-05 2008-12-11 Newell Catherine D Automatic story creation using semantic classifiers for digital assets and associated metadata

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110072129A (en) * 2018-01-22 2019-07-30 上海鹰信智能技术有限公司 Based on vehicle-mounted interconnection push, save the method for playing record, a kind of playback method

Also Published As

Publication number Publication date
US8467402B2 (en) 2013-06-18
US20140165010A1 (en) 2014-06-12
US20100061197A1 (en) 2010-03-11

Similar Documents

Publication Publication Date Title
US8467402B2 (en) Digital audio communication system
US9218110B2 (en) Information processing apparatus, information processing method, information processing program and recording medium for storing the program
US8229405B2 (en) Communication terminals, systems, methods, and computer program products for publishing, sharing and accessing media files
CN102483917B (en) For the order of display text
CN101415044B (en) Mobile terminal and method of displaying information therein
RU2490821C2 (en) Portable communication device and method for media-enhanced messaging
US9251506B2 (en) User interfaces for content categorization and retrieval
US20110039598A1 (en) Methods and devices for adding sound annotation to picture and for highlighting on photos and mobile terminal including the devices
US20070245006A1 (en) Apparatus, method and computer program product to provide ad hoc message recipient lists
WO2011084830A2 (en) Scenario-based content organization and retrieval
KR20060048794A (en) System and method to associate content types in a portable communication device
CN103823677A (en) Routing user data entries to applications
JP2018032912A (en) Information processing apparatus, information processing method, information processing program, and information processing system
KR101123370B1 (en) service method and apparatus for object-based contents for portable device
KR20140013253A (en) Contents searching system and method based on a cloud service, and portable device supporting the same
US20130204414A1 (en) Digital audio communication system
JP7254842B2 (en) A method, system, and computer-readable recording medium for creating notes for audio files through interaction between an app and a website
US20230350549A1 (en) Automatic incident compilation system
KR102427213B1 (en) Method, system, and computer readable record medium to manage together text conversion record and memo for audio file
KR102165339B1 (en) Method and apparatus for playing contents in electronic device
JP2023125031A (en) video creation program
JP2023125038A (en) video creation program
JP2023125030A (en) video creation program
JP2023125039A (en) video creation program
KR20200119761A (en) Method and apparatus for playing contents in electronic device

Legal Events

Date Code Title Description
AS Assignment

Owner name: MISELU, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:YOSHIKAWA, YOSHINARI;ZEREN, KAORU;MAYER, JUSTIN;AND OTHERS;SIGNING DATES FROM 20090909 TO 20090910;REEL/FRAME:030697/0069

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE