Intro to Whitebox Pentesting

Code Review - Authentication

Now that we understand what whitebox penetration testing is and the process we will follow, the remainder of this module will be used to demonstrate a practical example of a whitebox pentesting exercise. We will go through each step and apply what we discussed in the previous sections.

We will discuss a case of advanced code injection, which requires a whitebox pentest to identify and exploit the vulnerability properly. The specific vulnerability we will discuss would only be exploitable with access to the source code due to specific exploitation requirements that would not be evident without direct access to the source code, as is often the case with many other vulnerabilities.

Finally, for the sake of simplicity, we will not 'yet' be reviewing a large code base, as this would make the practical example very long, but other modules will cover larger code bases. Instead, we will focus on a particular functionality within the code, and the provided code base would only contain that functionality and other necessary functions for it to work. As previously discussed, a whitebox pentest is often only requested for a specific functionality instead of the entire code base, especially if whitebox pentest exercises were incorporated in the DevOps cycle. In such cases, we would test each new functionality rather than the whole code base.

With that said, let's get into reviewing the code.

Note: The module requires the installation of VSCode and node.js on your machine, which you can do by clicking on the previous links. If you prefer using PwnBox, then both tools are pre-installed there.

Data Gathering

As discussed in the code review section, the data-gathering phase usually consists of meetings to set the scope of the test and provide the code base and any available documentation for it. In this module, we will assume that we were given the code base in an archive without further details, which is the minimum requirement for any whitebox pentest.

We can start by downloading the archive found at the end of this section, extracting its content, and then opening it in VSCode, using File>Open Folder in VSCode or the following command:

[!bash!]$ code ./intro_to_whitebox_pentesting

Note: With PwnBox, you should use the codium command instead of code.

As we can see, the code base hierarchy is quite simple, consisting of an entry file (app.js) and a couple of other directories. So, let's look at the code to understand better how it works.

app.js

The app.js file starts by setting up an express server, setting up a JSON body parser, and then setting up the main API routes:

// set up express
const app = express();
const port = parseInt("5000");

// set up body parser and cors
app.use(bodyParser.json());

// set up API routes
app.use("/api/auth", authRoutes);
app.use("/api/service", serviceRoutes);

This is a basic express server setup for a node.js API backend. The rest of the file sets up 404 route handling and exception handling and ends by starting the express server:

// start the Express server
app.listen(port, () => {
  console.log(`⚡️[server]: Server is running at http://localhost:${port}`);
  console.log(`⚡️[api]: APIs are running at http://localhost:${port}/api`);
});

The only interesting bit from this file is the API routes, as the rest are basic express settings. So, let's take a look at those routes.

Authentication

In VSCode, we can hold CMD/CTRL and click on authRoutes, which will take us to the file containing these routes. The routes/auth-routes.js file simply consists of a single API endpoint with getUserToken:

const router = express.Router();

router.post("/authenticate", getUserToken);

module.exports = router;

The /authenticate endpoint requires a POST request and is used under /api/auth. To better look at this function, we can once again click on getUserToken to open it in a new file, and we will get the auth-controller.js file under the controllers/ directory. This file contains the following three functions:

validateEmail
getUserToken
verifyToken

function validateEmail(email) {
  return String(email)
    .toLowerCase()
    .match(/^...SIP...$/);
}

The validateEmail function appears to be local to this file since it is not exported at the end of the file. Taking a look at it, it seems a basic function that validates a string against a regular expression pattern to ensure it matches an email format.

getUserToken

Getting back to getUserToken, we see that it starts by obtaining the email parameter from req.body, which is the POST request body. We know from the bodyparser we saw previously that all endpoints expect a JSON body, so we should keep that in mind.

After that, the function validates the email format using the above validateEmail function, as denoted by a comment in the code. Such comments are always helpful to make it easier to understand the code. But what if the code didn't have any comments? In that case, we must rely on our coding knowledge to understand the functionality.

While we would be expected to have deep knowledge of the language we are reviewing in a' secure coding' exercise, the same is not a requirement for whitebox pentesting. This is because we would probably be testing various code bases in multiple languages, and we can't be expected to be experts in all languages, unlike secure coding, where we would usually be sticking to a single code base for an extended period.

This is why the primary skill we require for whitebox pentesting is the ability to understand the general purpose of the code, which should enable us to determine whether the code is vulnerable.

If we continue with the function, we will see that comments do not denote the next part. A quick look at it shows that it appears to be signing a jwt token that contains two keys:

email "from our input"
role "determined by email"

After that, the endpoint returns the signed jwt token. In case we were not sure of our understanding, we can ask AI to tell us what the function does using VSCode Copilot "or any other coding-aware AI chatbot, like ChatGPT":

Copilot goes into more detail, but it affirms our understanding. Such tools can be beneficial in the whitebox pentesting exercise, as they can simplify many tasks for us. However, a word of warning: Do not overly rely on AI for all tasks, as it is very common for it to make mistakes or miss stuff a human may notice. Mainly use it to confirm your understanding "as we just did" or to clarify something you do not understand "and then double check to confirm".

Note: This is a simplified authentication function that returns an authentication token to the user. This is done to avoid relying on a database that requires further setup and resources, but the general idea remains the same. Many other modules will have a full authentication mechanism, but this should be enough for our purposes.

verifyToken

Finally, we have the verifyToken function. It starts by obtaining the token from req.headers.authorization, which is the authorization HTTP header, as the name suggests. If no token is provided, it will give a 403 Unauthorized error. Otherwise, it uses the jwt.verify function to verify that the token is signed and not manipulated. If the token is signed, it adds it to the request's user object to be used by other endpoints in the server, as we will see later on.

This function retrieves the secure details stored in the user's token to be used by other endpoints in the server. So, if an endpoint uses verifyToken before it is called, then we know that this endpoint likely requires an authenticated token (i.e. valid user authentication).

So far, everything seems normal, so let's jump to the next route and see what it contains.

/ 1 spawns left

Waiting to start...

Questions

Answer the question(s) below to complete this Section and earn cubes!

+ 5 Review the remaining API endpoints. Which function do you think is most likely to be vulnerable?

+10 Streak pts

intro_to_whitebox_pentesting.zip

Previous Next

Go to Questions

Intro to Whitebox Pentesting

Patching & Remediation

Skills Assessment

Skills Assessment - Intro to Whitebox Pentesting

My Workstation

OFFLINE

/ 1 spawns left

Cheat Sheet

The cheat sheet is a useful command reference for this module.

The Whitebox Pentesting Process

Order	Step	Description
1.	`Code Review`	General review of the code to understand its functionality and shortlist potentially vulnerable functions
2.	`Local Testing`	Testing/Debugging the code locally to test our findings and identify vulnerabilities
3.	`Proof of Concept`	Writing an exploit to prove the exploitability of the target automatically
4.	`Patching & Remediation`	Patching the vulnerability and all of its sources/causes

Examples of Code Review Techniques

Select functions based on the application design.
Select functions and files through search.
Select functions through the use of the application.

Exploit Scripting Language Guide

Use Case	Recommended Language	Reason
Attack is on a network application (including web applications)	`Python`	It works similarly on most operating systems
Attacking a client-side function (e.g. a CSRF attack)	`JavaScript`	It is the only script executed by browsers
Web chain including a client-side attack	`Python` & `JavaScript`	We prepare a `JavaScript` payload for the client-side part. Then, use it with a `Python` script to trigger the exploit and carry on the rest of the back-end attacks.
Binary exploitation	`Python`	`Python` has good libraries for debugging and exploiting binaries, while `C`/`C++` may be used to develop a binary exploit.
Targeting an operating system	`Bash` or `PowerShell`/`CMD`	Whatever pre-installed scripting language on that operating system
Thick client or some advanced types of exploitation	The application's programming language	This would enable us to reuse code/functions to generate some payloads, which would save us a lot of time (vs re-scripting all of the logic in Python)

Code/Command Injection Functions

JavaScript 'NodeJS'	Python	PHP	C/C++	C#	Java
`eval`	`eval`	`eval`	execlp
`Function`	exec	exec	execvp
`setInterval`	subprocess.open	proc_open	ShellExecute
`setTimeout`	subprocess.run	popen
`constructor.constructor`	os.system	shell_exec
child_process.exec	os.popen	passthru	system	System.Diagnostics.Process.Start	Runtime.getRuntime().exec
child_process.spawn		system	popen

Payload Development Rules

Comment out the rest of the code
Ensure quotes/parentheses/curly braces are even
Maintain a working function without syntax errors

Methods for Obtaining Command Output

Log output to console "for local testing"
Use a reverse shell
Use DNS exfiltration (or ping exfiltration)
Store the output in the database
Write the output to a file, then access that file
Inject the output into the HTTP response
Use sleep timers or boolean output to read the content

Intro to Whitebox Pentesting

Code Review - Authentication

Data Gathering

app.js

Authentication

getUserToken

verifyToken

Questions

Table of Contents

Whitebox Pentesting Process

Code Review

Local Testing

Proof of Concept (PoC)

Patching & Remediation

Skills Assessment

My Workstation