Database Queries
This page is for people who are developing their own Autopsy plugin modules that require SQL queries. If you are not developing a module requiring SQL queries, you can skip this page. Autopsy currently allows either SQLite or PostgreSQL as the back-end database system for a case. Any module you write could be used with either as the backend database, at the user's discretion.
If you are writing code actually for Autopsy, not just an Autopsy module, you may need to be able to INSERT and UPDATE into the database as well. Please see INSERT and UPDATE.
Which Database is my Module Accessing?
In an Autopsy Module, you can check the database type currently in use with the following code snippet:
Case currentCase = Case.getCurrentCase();
if (currentCase.getCaseType() == Case.CaseType.MULTI_USER_CASE)
{
}
else
{
}
TSK methods to Query the Database With User-Supplied SQL
The following SleuthkitCase methods are available for the user to supply all of, or a portion of, a SQL query.
ArrayList<BlackboardAttribute> getMatchingAttributes(String whereClause)
ArrayList<BlackboardArtifact> getMatchingArtifacts(String whereClause)
long countFilesWhere(String sqlWhereClause)
List<AbstractFile> findAllFilesWhere(String sqlWhereClause)
List<Long> findAllFileIdsWhere(String sqlWhereClause)
CaseDbQuery executeQuery(String query)
List<FsContent> findFilesWhere(String sqlWhereClause) [deprecated]
ResultSet runQuery(String query) [deprecated]
void closeRunQuery(ResultSet resultSet) [deprecated]
The majority of them only allow the user to specify a WHERE clause, determining which records to SELECT.
The following example finds all the .txt files in the case:
List<AbstractFile> files = sk.findAllFilesWhere("LOWER(name) LIKE '%.txt'");
How to Avoid Pitfalls When Using the Query Methods
Because there are multiple backend databases, care must be taken to use strict SQL. When things must be different between database types, use Which Database is my Module Accessing? to determine which database type is currently in use and create the proper SQL statements. Be sure to test your module with both types of databases. They behave differently and will give you different resulting output order.
WHERE Clause Syntax
- Do not use backticks. PostgreSQL does not use them like SQLite does.
- Use only single quotes to quote values. Do not use double quotes for this. Quoting values is not required.
SELECT * FROM tsk_files WHERE has_path = "1"
SELECT * FROM tsk_files WHERE has_path = '1'
SELECT * FROM tsk_files WHERE has_path = 1
- Use only double quotes to quote column names. Do not use single quotes for this. Quoting column names is not required.
SELECT 'obj_id' FROM tsk_files WHERE has_path = 1
SELECT "obj_id" FROM tsk_files WHERE has_path = 1
SELECT obj_id FROM tsk_files WHERE has_path = 1
- Do not use || and && to connect logical clauses. This does not exist in PostgreSQL. Use OR and AND instead.
SELECT COUNT(*) FROM tsk_files WHERE dir_type = '5' && md5 IS NULL || size > '0'
SELECT COUNT(*) FROM tsk_files WHERE dir_type = '5' AND md5 IS NULL OR size > '0'
- PostgreSQL compares are case-sensitive. Always specify what type of compare you want. UPPER() and LOWER() can help with that.
SELECT * from people WHERE first_name LIKE '%somename%'
SELECT * from people WHERE first_name ILIKE '%somename%'
SELECT * from people WHERE LOWER(first_name) LIKE LOWER('%somename%')
- When generating WHERE queries via code, some folks include an AND(1) or OR(0) clause in the query as a placeholder that does not effect the outcome of the query but simplifies the query-generation logic. PostgreSQL does not allow true or false comparisons with integers. The PostgreSql syntax is AND(true) or OR(false). SQLite does not allow the PostgreSQL syntax and PostgreSQL does not allow the SQLite syntax. Do not use this trick to generate queries. Instead, have your code handle the edge cases of if there are no entries for the AND or OR portion of a clause.
WHERE id=12 AND(1)
WHERE id=12 AND(true)
WHERE id=12
- SQLite allows non-standard usage of the IS keyword. Standard usage of IS checks if something IS NULL or IS NOT NULL. It does not compare against specific values. Remember when comparing values to use = instead of the IS keyword. If you want to check for NULL, then IS NULL is the right tool. Example:
WHERE value IS '4'
WHERE value = '4'
WHERE value != '4'
WHERE value IS NULL
WHERE value IS NOT NULL
How to ORDER BY Consistently
- SQLite and PostgreSQL have different default sort orders for returned records, so you want to fully specify ORDER BY clauses for both database types. Example:
Case currentCase = Case.getCurrentCase();
String orderByClause;
if (currentCase.getCaseType() == Case.CaseType.MULTI_USER_CASE)
{
orderByClause = "ORDER BY att.value_text, ASC NULLS FIRST";
}
else
{
orderByClause = "ORDER BY att.value_text ASC";
}
- Do not use COLLATE NOCASE to order output. This does not exist in PostgreSQL. Use LOWER() or UPPER() instead.
ORDER BY tsk_files.dir_type, tsk_files.name COLLATE NOCASE
ORDER BY tsk_files.dir_type, LOWER(tsk_files.name)
- In ORDER BY clauses, PostgreSQL ignores leading dashes. Given the following data, you will see the following two sort orders for the different databases.
Data | PostgreSQL sort order | SQLite sort order |
Alpha | Alpha | -Bravo |
-Bravo | -Bravo | Alpha |
Charlie | Charlie | Charlie |
To force PostgreSQL to not ignore leading dashes, convert strings to SQL_ASCII before sorting by them. This is done with convert_to(), but it only exists in PostgreSQL.
ORDER BY some_value
ORDER BY convert_to(some_value, 'SQL_ASCII')
With the code above, using SQL_ASCII encoding, the following results are seen:
Data | PostgreSQL sort order | SQLite sort order |
Alpha | -Bravo | -Bravo |
-Bravo | Alpha | Alpha |
Charlie | Charlie | Charlie |
- PostgreSQL sorts NULLs last for ASC and first for DESC. SQLite does the opposite. PostgreSQL allows you to control the NULL sort order with NULLS FIRST or NULLS LAST
ORDER BY att.value_text ASC
ORDER BY convert_to(att.value_text, 'SQL_ASCII') ASC NULLS FIRST