You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
3.1 KiB
3.1 KiB
CLI Cheatsheet
Contents
Drupal / Drush
Note: These SQL queries target the Drupal 7 schema (
node_typetable). They won't work as-is on Drupal 8+.
List content types with node counts
drush sqlq 'select count(node.nid) as node_count, node_type.type
from node
inner join node_type on node.type = node_type.type
group by node_type.type'
Filter results by keyword (e.g. "2014")
drush sqlq 'select count(node.nid) as node_count, node_type.type
from node
inner join node_type on node.type = node_type.type
group by node_type.type' | grep 2014
Text Processing
Find and replace inside files (perl)
# Single file type in current directory
perl -pi -w -e 's/SEARCH_FOR/REPLACE_WITH/g;' *.txt
# Recursively across all file types
perl -pi -w -e 's/thex/robertsonlibrary/g;' **/*.*
Find and replace in file names
# Rename files matching a pattern, verbose output shows what changed
rename 's/livero/lives/g' **/*.* -v
Torrents
Download via magnet link (aria2c)
# aria2c is a lightweight multi-protocol download utility
aria2c -d ~/Downloads "magnetlink"
PDF Tools
OCR a PDF (ocrmypdf)
# Standard: optimize output, skip pages that already have a text layer
ocrmypdf --optimize 3 --skip-text input.pdf output.pdf
# Aggressive: force re-OCR even if a text layer exists (useful for corrupt/bad layers),
# set DPI manually, use page segmentation mode 1 (automatic with OSD)
ocrmypdf --optimize 3 --image-dpi 300 --output-type pdf \
--force-ocr --tesseract-pagesegmode 1 input.pdf output.pdf
Downsample a PDF to 72dpi (Ghostscript)
# Single file
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 \
-dPDFSETTINGS=/screen \
-dNOPAUSE -dQUIET -dBATCH \
-sOutputFile=output.pdf input.pdf
# Batch - processes all PDFs in current folder, preserves original filenames
mkdir -p downsampled
for f in *.pdf *.PDF; do
[ -f "$f" ] || continue
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/screen \
-dNOPAUSE -dBATCH -dQUIET \
-sOutputFile="downsampled/$f" \
"$f"
done
Check image DPI in PDFs (pdfimages)
# Print image info to terminal
for f in *.pdf *.PDF; do
echo "=== $f ==="
pdfimages -list "$f"
echo ""
done
# Save output to images_list.txt instead
for f in *.pdf *.PDF; do
[ -f "$f" ] || continue
echo "=== $f ===" >> images_list.txt
pdfimages -list "$f" >> images_list.txt
echo "" >> images_list.txt
done
Scan for CCITT encoding
# CCITT is a fax-era compression format - flags PDFs that may cause compatibility issues
for f in *.pdf *.PDF; do
[ -f "$f" ] || continue
if pdfimages -list "$f" 2>/dev/null | grep -q " ccitt "; then
echo "$f uses CCITT"
fi
done