Python Sample Script

You inherit a folder full of bash scripts. They source each other, they call each other through pipelines, and after a while nobody remembers who depends on whom. You want a picture — “if I touch deploy.sh, what else might break?” — without reading every file by hand. This little Python script does exactly that: it scans a directory of .sh files, finds every reference one script makes to another, and renders the relationships as a GraphViz diagram. 🐍

What you’ll need

Python 3 — the code below runs as-is on a modern Python 3 (it was originally written for Python 2 but uses no incompatible idioms, so just python3 yourscript.py)
Graphviz — provides the dot command-line tool that turns DOT notation into an image. Install with sudo apt install graphviz (Debian/Ubuntu), sudo dnf install graphviz (Fedora/RHEL), or brew install graphviz (macOS).
An SVG viewer — the script ends by opening the result in eog (Eye of GNOME). On macOS, swap that for open; on Windows, start.

No third-party Python libraries are needed — everything (re, glob, ntpath, subprocess) is in the standard library.

Step 1: scan a single bash file for dependencies

The first job is to open one bash file and find every .sh filename mentioned inside it. We use a regex to grab anything that looks like a path ending in .sh, then walk backwards from each match to the start of the line to check whether the match is inside a comment or a quoted string — if it is, we skip it.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32

import re
import ntpath
import glob
import subprocess

# In order to get this tool to run correctly, you'll need python installed (duh!) and graphviz.
basedir = "/home/ronald-dev/allBashScript/"

'''
This function will scan a bash file, and look for any dependency to another bash file
@param String The fileName
@return Array The dependency filenames
'''
def readDependencies(fileName):
f = open(fileName, 'r')
content = f.read()
occurences = {} # {file1.sh:1, file2.sh:1, file3.sh:1}
for m in re.finditer('[\w|\/]*\.sh', content):
word = ntpath.basename(content[m.start(): m.end()])
if word not in occurences:
isComment = False
index = m.start()
while index > 0 and content[index] != "\n":
if content[index] == """ or content[index] == "'" or content[index] == "#":
isComment = True
break
else:
index = index - 1
if isComment == False:
occurences[word] = 1
f.close()
return occurences.keys()

The dictionary trick (occurences[word] = 1) is just an old-school way of getting a unique set of names — every file only counts once, no matter how many times it’s referenced. Today you’d reach for a set() instead, but the result is the same. Change basedir to wherever your bash scripts live.

Step 2: turn dependencies into GraphViz DOT notation

GraphViz speaks a tiny language called DOT. A directed edge from a.sh to b.sh looks like “a.sh” -> “b.sh”;. Wrap a list of those in digraph G { … } and you have a complete graph. We need a function that turns one file’s dependencies into those edges, and a driver that does it for the whole directory.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33

'''
Scan one bash file and convert its dependencies to GraphViz notation
@param String The fileName
@param Array The dependency filenames
@return String GraphViz notation "bash01.sh" -> "bash02.sh";[NEWLINE] etc
'''
def toGraphvizNotation(fileName, arrayOfDependencies):
string = ""
filename = ntpath.basename(fileName)
for dependency in arrayOfDependencies:
string += ' "' + filename + '" -> "' + dependency + "";\n"
return string

'''
Scans all bash files, with extension .sh
@param String The directory to scan
@return Array Array of filenames
'''
def readAllBashFiles(directory):
return glob.glob(directory + "*.sh")

'''
Scan multiple bash files and convert their dependencies to GraphViz notation
@param String The directory to scan
@return String GraphViz notation "bash01.sh" -> "bash02.sh";[NEWLINE] etc
'''
def generateGraphVizFile(basedir):
bashfiles = readAllBashFiles(basedir)
graphVizFileContent = ""
for bashfile in bashfiles:
graphVizFileContent += toGraphvizNotation(bashfile, readDependencies(bashfile))
graphVizFileContent = "digraph G { \n" + graphVizFileContent + "}"
return graphVizFileContent

At this point we can turn the whole directory into a single block of DOT text — no files written yet, no images rendered, just a string that looks like:

1
2
3
4
5

digraph G {
"deploy.sh" -> "common.sh";
"deploy.sh" -> "db_backup.sh";
"db_backup.sh" -> "common.sh";
}

Step 3: write the .dot file, render it, and open the picture

The last step has three parts: write the DOT string to a file, shell out to dot to convert it to SVG, then open the SVG in an image viewer. Three lines, one for each.

1
2
3
4
5
6
7
8
9
10
11
12
13

'''
Write content to a file
@param String The filename to be written to
@param String The content
'''
def writeGraphvizFile(filename, graphVizFileContent):
f = open(filename, 'w')
f.write(graphVizFileContent)
f.close()

writeGraphvizFile("/tmp/bashdependencies.dot", generateGraphVizFile(basedir))
subprocess.call(["/usr/bin/dot", "-T", "svg", "-o", "/tmp/allBashScript.svg", "/tmp/bashdependencies.dot"])
subprocess.call(["eog", "/tmp/allBashScript.svg"])

Run the script and you should see your dependency graph open in an image viewer. If you’d rather just produce the SVG without opening it, drop the last subprocess.call line.

A few things worth knowing.

Python 3 vs Python 2. The original was written in 2014, when Python 2 was still everyone’s default. The functional code above runs unchanged on Python 3 — there are no print statements, no xrange, no implicit str/bytes mixing. Add a shebang and you’re done:

1	#!/usr/bin/env python3

The dot path is hard-coded. /usr/bin/dot works on most Linux distros but not on macOS (where Homebrew puts it under /opt/homebrew/bin/ on Apple Silicon, or /usr/local/bin/ on Intel) and not on Windows. Drop the absolute path and let $PATH resolve it:

1	subprocess.call(["dot", "-T", "svg", "-o", "/tmp/allBashScript.svg", "/tmp/bashdependencies.dot"])

Same idea for the viewer: eog is GNOME-only. Cross-platform, the standard trick is:

1
2
3

import sys
opener = {"darwin": "open", "win32": "start"}.get(sys.platform, "xdg-open")
subprocess.call([opener, "/tmp/allBashScript.svg"])

The regex has a small bug. The pattern [\w|\/]*\.sh uses a character class that includes | as a literal pipe — not as alternation, because | has no special meaning inside square brackets. It still works in practice (pipes rarely appear next to .sh), but the cleaner intent is [\w/]*\.sh.

The comment-detection misses quoted dependencies. The script walks backwards looking for #, ‘, or “ to decide whether a match is inside a comment or string — and skips it if it is. That’s correct for # source helper.sh, but it also skips legitimate dependencies like source “common.sh” or bash ‘helper.sh’, because the trailing quote on the same line trips the check. If your scripts wrap filenames in quotes, you’ll get a graph that’s missing edges. A more robust approach is to strip comments first (everything after a non-quoted #), then run the regex on the cleaned content.

Modernise the file I/O. The pattern f = open(…); … ; f.close() works but won’t close the file if anything in between throws. Use with:

1 2	with open(fileName, 'r') as f: content = f.read()

And of course, in 2026 you might just ask a smart code-search tool for the same picture — but there’s still something satisfying about a script you can read end to end in five minutes. 🌳

What you’ll need

Step 1: scan a single bash file for dependencies

Step 2: turn dependencies into GraphViz DOT notation

Step 3: write the .dot file, render it, and open the picture

Archives

Meta

Categories