Law in the Internet Society

LawNetSoc » FirstEssay » GabrielaFloresRomoFirstEssay

Predictive Text-- Convenience or Catastrophe?

It is strongly recommended that you include your outline in the body of your essay by using the outline as section titles. The headings below are there to remind you how section and subsection titles are formatted.

Predictive Text-- Convenience or Catastrophe?

-- -- By GabrielaFloresRomo - 29 Dec 2021

How did Google Know?

In a minute, my final draft was gone. After calling every technology support available, receiving an extension, and re-doing an entire issue for my brief, I began solely working on Google Docs (“GD”). GD’s autosave and history browsing seemed like lifesavers, until a new designed-in feature piqued my interest, both in Gmail and GD.

My first thought when I noticed the auto-complete feature on Google’s Gmail and GD was, “how did they know what I was going to say?” Google’s “Smart Compose” is a neural-network model—machine learning model in which a computer learns to perform a task by analyzing training examples—that classifies and predicts sequences of natural language. Something invisible to untrained eyes was now put at the centerstage for the sake of convenience. Whether I was taking class notes or drafting a personal email, Google apparently “read my thoughts.”

Smart Compose-- Is it Inherently Smart?

GD’s Smart Compose “assists with composing new messages from scratch and provides much richer and more diverse suggestions along the way,” and it is trained by email data, which includes machine-collecting data from email chains, email subjects, date and time of emails, and location of the user. While Google claims individuals cannot individually read these emails in order to protect privacy, this data compilation must be stored to determine individuals’ often repeated written sequences of words. Thus, while humans may not be actively reading personal information, a database tracking timestamps, destinations, and from whom and attaching strings of words to certain individuals is likely readily available.

Regardless of these assurances, however, “even if sensitive or private training-data text is very rare [given the omission of some text and focus on commonly stringed words], one should assume that well-trained models have paid attention to its precise details.” Since these systems store information based on repeated phrases, individuals may mine for private details by using sequencing of auto-fill to determine private information. Since Google’s Smart Compose uses location, although tedious, it is not unfathomable that two competing individuals in the same proximity may attempt to steal information from each other in this manner. Additionally, given that some companies’ sole purpose is to buy and sell individuals’ information, they may also mine for private information, given their interest in attaining and selling as much information as possible. Thus, private information can be made public not only to one’s surprise but to the benefit of others as well. The unavoidable consequences of using products such as GD is that what’s stored in the cloud remains available for use by others and companies with nefarious consequences to the consumer.

Since humans had to develop this Smart Compose system based on unattainable data, biases also likely promulgate the system. Suggestions of what comes next, thus, are more likely to adhere to developers’ vernaculars. One way Google has combated this is by removing gender-based pronouns so that it reduces the risk of incorrectly suggesting one’s gender identity and suggesting biased autofill phrases. Although this reduces a variable, it is not enough. While Google claims that its language model adapts to a user’s personal mail data, thus personalizing text, biases may persist based on gathered information as well. For example, cultural group associations, geographical location, and socioeconomic information may suggest preferences based on string of words associated with and linking individuals from different areas. As use becomes more prevalent, groupthink on what is correct may become more pervasive between groups, especially amongst those who adhere to groups that have historically had power. This could further ignite ongoing debates about English, and language’s social and political implications. The implications of predictive words surpass convenience; they include biases and promote further divisions in language.

Thus, the solution is to turn to non-cloud, alternative software systems with autosave capability and crash protection. For example, LibreOffice? —a free open-source software in which users themselves are contributors who assist in and create the developments, designs, infrastructure, and marketing, amongst other features— is an alternative to Microsoft Word and GD, and it is compatible with different document formats to use in any given office space. It offers different extensions, such as AddPics, which creates a text document from pictures of scanned pages, and Angry Reviewer, which creates academic style corrections for papers. Although LibreOffice? uses AutoComplete? as an accessibility feature,, it compiles words you use via word completion, removing biases prevalent in GD and which may also be removed. Moreover, LibreOffice? has a document recovery system that recovers crashed or unsaved documents via a backup and temporary file document restoration compilation. With free and also non-free options available, the goal should now be to further disseminate information about these available systems.

Problems and Response

These Smart Compose concerns are not limited to Google, however; Apple and Samsung, among others, use predictive text software. While Google does allow its users to turn off the Smart Compose feature so that they can no longer receive predictive text suggestions, it is uncertain whether this simply limits individuals from seeing the predictive text or whether individuals have fully opted out of the system. This begs the question of whether Smart Compose can still access individual’s information regardless of the opt-out feature.

While a first step is to turn off Smart Compose software on devices, the goal should be to work on systems that either fully allow you to disengage from these monitoring and scanning technologies or switch to software that already does not actively use any of these systems, such as LibreOffice? . While counterintuitive for our purposes, given the popularity of social media nowadays, a possible move would be to use social media to one’s advantage to disseminate information regarding alternative systems. By creating a business profile, one could take advantage of social platform business tools, such as product teasers, marketing tools, sponsored advertisements, stories, hashtags, and sharing testimonies and tutorials, to widely share information of these systems using non-traceable devices. LibreOffice? , for example, uses a Twitter account; however, partnering with pages, information can be more swiftly circulated. Thus, the idea that one could access word processing systems without surveillance and third-party access, and with functions that are comparable, and even better, than Microsoft and GD, could be targeted to current generations that are already wary of third-party monitoring.

You are entitled to restrict access to your paper if you want to. But we all derive immense benefit from reading one another's work, and I hope you won't feel the need unless the subject matter is personal and its disclosure would be harmful or undesirable. To restrict access to your paper simply delete the "#" character on the next two lines:

#Set ALLOWTOPICVIEW = TWikiAdminGroup, GabrielaFloresRomo
#Set DENYTOPICVIEW = TWikiGuest

Note: TWiki has strict formatting rules for preference declarations. Make sure you preserve the three spaces, asterisk, and extra space at the beginning of these lines. If you wish to give access to any other users simply add them to the comma separated ALLOWTOPICVIEW list.

Navigation

LawNetSoc LawNetSoc

Webs Webs

r3 - 29 Dec 2021 - 19:47:08 - GabrielaFloresRomo

This site is powered by the TWiki collaboration platform.
All material on this collaboration platform is the property of the contributing authors.
All material marked as authored by Eben Moglen is available under the license terms CC-BY-SA version 4.

Syndicate this site RSS ATOM