If you've set up a scan in AppCheck then you will be familiar with the Targets box; it's usually the first thing you fill in when setting up a web application scan.

Screenshot_2021-05-18_at_11.36.33.png

What you may not be aware of is another place to specify targets:

Web Application Scanner Settings
- Advanced Settings
  - Seeded Targets

This article explains a little about how the types of targets define the behaviour of your web application scan.

Scope
Adding Targets to Your Account Scope
How Targets Define a Scan
Seeded Targets
Application Root
What Should You Do If You Want To Specify Several Paths Within One Application?
What Should You Do If You Do Not Want To Scan Everything Under / ?
What Should You Do If You Want To Explicitly Exclude A Specific URL From A Scan?

Scope

Your organisations' AppCheck account has an associated list of application URLs and infrastructure addresses, known as your account scope. A target can only be added to a scan if it is within your account scope. For example, take the following account scope:

https://example.com
example.com
123.1.0.0/16

https://example.com is in scope and can be scanned, but https://www.example.com is out of scope and cannot be scanned.

123.1.0.99 is within scope and can be scanned, but 123.99.0.0 is out of scope and cannot be scanned.

Adding Targets to Your Account Scope

Targets can be added in the Target Scope section of the AppCheck platform.

Once added, targets cannot be removed from scope until edit date. Please contact AppCheck Support if you have issues updating your scope.

How Targets Define a Scan

When a scan is launched, the first stage is a "crawl" of each application specified as a target.

Starting with the application's root (which is usually /), the scanner looks for hyperlinks in the returned HTML content and for URLs/paths mentioned in source code (including in scripts, frames etc).

The scanner then follows these links and repeats the process recursively until the crawler has a complete map of the application. It will also make requests to additional paths that commonly exist even if it doesn't see links to them - such as /admin (or /wp-admin looking for WordPress sites).

The resulting map of the application (known as the Mapped Attack Surface) is then passed on to the next stage of the process where active scanning takes place.

For more information on the Mapped Attack Surface, see How can I see a list of which paths or URLs AppCheck has scanned (crawled and attacked) for my web application?

Seeded Targets

Screenshot_2021-05-18_at_12.01.21.png

Seeded targets are URLs that are explicitly added to the map (the scanner's list of potential attack points within an application) before crawling, to ensure that the given paths and query strings (and anything else found by continuing to crawl from them) are included in the scan. Often they would be found anyway during the crawl, but adding them as Seeded Targets just makes sure they're not missed for any reason.

This is generally only needed when a given URL can't be found by crawling from the application root (ie there's no link to it from the rest of the application), or when a URL may be incorrectly removed from the map during de-duplication.

Application Root

The crucial point to be aware of is that the root of an application is assumed to be / even if a path is specified in the URL in the Targets box.

For example, if you add the following URL to your Targets box:

https://example.com/login

then the scan target is treated as https://example.com/ while /login is treated as a seeded target (ie it is added to the map of the application, and we crawl from there too). This is useful in situations where the scan is configured with the URL of the application's login page as the target, but where the intention is to scan the entire application (not just the login page).

This means if you add two URLs to the scan's targets box:

https://example.com/
https://example.com/login

then what you actually have is two identical scan targets:

https://example.com/
https://example.com/

and one seeded target:

https://example.com/login

meaning you scan the whole application (https://example.com/) twice. This should be avoided as it will results in twice as many requests to your application, increasing load on your servers and on the scanner, and doubling the time taken for the scan. The correct solution would be to add only https://example.com/login to the scan's targets box.

What Should You Do If You Want To Specify Several Paths Within One Application?

Add the root of the application to the Targets box and add the extra URLs to the Seeded Targets box.

For example, add

https://example.com/

as the target, and add

https://example.com/login
https://example.com/my-application/

as seeded targets. This means we will scan all of https://example.com/ and we will be sure to include /login and /my-application.

What Should You Do If You Do Not Want To Scan Everything Under / ?

If you want to specify a root other than / then you can do so using a pipe character (|) after the URL in the Targets box, eg:

https://example.com/my-application/|

In this case the scan target is treated as

https://example.com/my-application/

with /my-application/ as the root, and nothing outside that path, such as https://example.com/ or https://example.com/login, will be scanned.

What Should You Do If You Want To Explicitly Exclude A Specific URL From A Scan?

AppCheck allows the specifying of Targets to Exclude.

Targets to Exclude are not actively scanned, meaning the scanner will not send attack payloads or large numbers of requests to the denied URLs (as it does to allowed targets). However, some requests may still be made to them during crawling phase.

A small number of innocuous requests may be sent, mostly during the crawling phase of the scan while the scanner builds its map of the application, particularly if the excluded targets must be passed to discover other parts of the application.

For example, if your login page is in Targets to Exclude, then attack requests will not be sent to it, but requests required to create authenticated sessions will still be sent.

Targets to Exclude can be found just bellow the Targets box near the top of the scan configuration page:

Targets to Exclude are matched against the start of any URL - any URL which begins with a denied URL will be excluded from attacks. In the above example the excluded target contains https://example.com/do-not-scan. This means https://example.com/do-not-scan/secret-child-page will also not be attacked, and so on.

Wildcards are supported in the form of *. For example, if you wish to exclude all subdomains beginning with abc you would add the denied target https://abc*.example.com. If you wish to exclude all paths with xzy in their name, you would add the denied target http://example.com/*xyz*.

The Targets to Exclude are not case sensitive. This functionality does not support wildcards or regular expressions. The target needs to be in a URL format as it appears in a web browser.

The Targets to Exclude patterns are matched against the URL including the query component. This means you can add specific URL parameter names or even values to the excluded targets list. For example, you could exclude https://example.com/q?this=something and still scan other values for the parameter this, or you could exclude https://example.com/q?this= and not scan any example of the this parameter on that path.

The order of URL parameters is not guaranteed; therefore this feature may not work if the URL contains multiple parameters.

Articles in this section

Application Scan Targets, Scope, Seeded Targets and Targets to Exclude

Scope

Adding Targets to Your Account Scope

How Targets Define a Scan

Seeded Targets

Application Root

What Should You Do If You Want To Specify Several Paths Within One Application?

What Should You Do If You Do Not Want To Scan Everything Under / ?

What Should You Do If You Want To Explicitly Exclude A Specific URL From A Scan?

Comments

Articles in this section

Scope

Adding Targets to Your Account Scope

How Targets Define a Scan

Seeded Targets

Application Root

What Should You Do If You Want To Specify Several Paths Within One Application?

What Should You Do If You Do Not Want To Scan Everything Under / ?

What Should You Do If You Want To Explicitly Exclude A Specific URL From A Scan?

Related articles