Update: Twitter has now revealed its schedule for rolling out the new measures, ranging from a new definition of non-consensual nudity this month to ‘bystander reporting’ in January.

Twitter has responded to Friday’s #WomenBoycottTwitter protest by revealing new steps it will be taking to tackle abusive posts on the platform.

The protest, in which many women refrained from posting on the social network for 24 hours, led to an 8-tweet response from CEO Jack Dorsey in which he said that he recognized the company was not doing enough, and promised new rules in five areas …

NordVPN

New rules around: unwanted sexual advances, non-consensual nudity, hate symbols, violent groups, and tweets that glorifies violence.

While Twitter has not officially made public details of these rules, Wired obtained an email sent by Twitter’s head of safety policy to members of its Trust and Safety Council in which the new rules were listed.

The email lists both current and upcoming rules, showing that the company is taking a more proactive approach. For example, in the area of unwanted sexual advances, Twitter is switching from awaiting reports from those on the receiving end to a combination of ‘bystander reporting’ and algorithms.

We are going to update the Twitter Rules to make it clear that this type of behavior is unacceptable. We will continue taking enforcement action when we receive a report from someone directly involved in the conversation. Once our improvements to bystander reporting go live, we will also leverage past interaction signals (eg things like block, mute, etc) to help determine whether something may be unwanted and action the content accordingly.

Twitter currently responds to non-consensual nudes with a two-strike policy, the first strike resulting in the tweet being deleted and a temporary ban. Under the new rules, this will move to a one-strike policy, where the account is immediately suspended.

Hate symbols will be treated as sensitive content and initially hidden. Enforcement action will be taken against groups which ‘use/have historically used violence as a means to advance their cause.’ Tweets that glorify violence, praising or condoning it, will also be subject to action, rather than just those making direct threats.

The full email can be read below.

Photo: Chris Ratcliffe/Bloomberg

Dear Trust & Safety Council members,

I’d like to follow up on Jack’s Friday night Tweetstorm about upcoming policy and enforcement changes. Some of these have already been discussed with you via previous conversations about the Twitter Rules update. Others are the result of internal conversations that we had throughout last week.

Here’s some more information about the policies Jack mentioned as well as a few other updates that we’ll be rolling out in the weeks ahead.

Non-consensual nudity

  • Current approach *We treat people who are the original, malicious posters of non-consensual nudity the same as we do people who may unknowingly Tweet the content. In both instances, people are required to delete the Tweet(s) in question and are temporarily locked out of their accounts. They are permanently suspended if they post non-consensual nudity again.
  • Updated approach *We will immediately and permanently suspend any account we identify as the original poster/source of non-consensual nudity and/or if a user makes it clear they are intentionally posting said content to harass their target. We will do a full account review whenever we receive a Tweet-level report about non-consensual nudity. If the account appears to be dedicated to posting non-consensual nudity then we will suspend the entire account immediately.

*Our definition of “non-consensual nudity” is expanding to more broadly include content like upskirt imagery, “creep shots,” and hidden camera content. Given that people appearing in this content often do not know the material exists, we will not require a report from a target in order to remove it.

*While we recognize there’s an entire genre of pornography dedicated to this type of content, it’s nearly impossible for us to distinguish when this content may/may not have been produced and distributed consensually. We would rather error on the side of protecting victims and removing this type of content when we become aware of it.

Unwanted sexual advances

  • Current approach *Pornographic content is generally permitted on Twitter, and it’s challenging to know whether or not sexually charged conversations and/or the exchange of sexual media may be wanted. To help infer whether or not a conversation is consensual, we currently rely on and take enforcement action only if/when we receive a report from a participant in the conversation.
  • Updated approach *We are going to update the Twitter Rules to make it clear that this type of behavior is unacceptable. We will continue taking enforcement action when we receive a report from someone directly involved in the conversation. Once our improvements to bystander reporting go live, we will also leverage past interaction signals (eg things like block, mute, etc) to help determine whether something may be unwanted and action the content accordingly.

Hate symbols and imagery (new)*We are still defining the exact scope of what will be covered by this policy. At a high level, hateful imagery, hate symbols, etc will now be considered sensitive media (similar to how we handle and enforce adult content and graphic violence). More details to come.

Violent groups (new)*We are still defining the exact scope of what will be covered by this policy. At a high level, we will take enforcement action against organizations that use/have historically used violence as a means to advance their cause. More details to come here as well (including insight into the factors we will consider to identify such groups).

Tweets that glorify violence (new)*We already take enforcement action against direct violent threats (“I’m going to kill you”), vague violent threats (“Someone should kill you”) and wishes/hopes of serious physical harm, death, or disease (“I hope someone kills you”). Moving forward, we will also take action against content that glorifies (“Praise be to for shooting up. He’s a hero!”) and/or condones (“Murdering makes sense. That way they won’t be a drain on social services”). More details to come.

We realize that a more aggressive policy and enforcement approach will result in the removal of more content from our service. We are comfortable making this decision, assuming that we will only be removing abusive content that violates our Rules. To help ensure this is the case, our product and operational teams will be investing heavily in improving our appeals process and turnaround times for their reviews.

In addition to launching new policies, updating enforcement processes and improving our appeals process, we have to do a better job explaining our policies and setting expectations for acceptable behavior on our service. In the coming weeks, we will be:

  • updating the Twitter Rules as we previously discussed (+ adding in these new policies)
  • updating the Twitter media policy to explain what we consider to be adult content, graphic violence, and hate symbols.
  • launching a standalone Help Center page to explain the factors we consider when making enforcement decisions and describe our range of enforcement options launching new policy-specific Help Center pages to describe each policy in greater detail, provide examples of what crosses the line, and set expectations for enforcement consequences
  • Updating outbound language to people who violate our policies (what we say when accounts are locked, suspended, appealed, etc).

We have a lot of work ahead of us and will definitely be turning to you all for guidance in the weeks ahead. We will do our best to keep you looped in on our progress.

All the best,

Head of Safety Policy


Check out 9to5Mac on YouTube for more Apple news:

About the Author

Ben Lovejoy's favorite gear