How to avoid certain directories when using os.walk?
Dave Angel
davea at ieee.org
Fri Oct 30 10:54:29 EDT 2009
Chris Rebert wrote:
> On Thu, Oct 29, 2009 at 9:53 PM, Peng Yu <pengyu.ut at gmail.com> wrote:
>
>> I don't see a way to avoid walking over directories of certain names
>> with os.walk. For example, I don't want os.walk return files whose
>> path include '/backup/'. Is there a way to do so? Otherwise, maybe I
>> will have to make my own program. Thank you!
>>
>
> Read the docs! (http://docs.python.org/library/os.html#os.walk):
>
> """
> os.walk(top[, topdown=True[, onerror=None[, followlinks=False]]])
>
> Generate the file names in a directory tree by walking the tree
> either top-down or bottom-up. For each directory in the tree rooted at
> directory top (including top itself), it yields a 3-tuple (dirpath,
> dirnames, filenames).
> [...]
> When topdown is True, the caller can modify the dirnames list
> in-place (perhaps using del or slice assignment), and walk() will only
> recurse into the subdirectories whose names remain in dirnames; this
> can be used to prune the search, impose a specific order of visiting,
> or even to inform walk() about directories the caller creates or
> renames before it resumes walk() again.
> """
>
> They even include a specific code example of how to skip unwanted
> subdirectories.
>
> Cheers,
> Chris
> --
> http://blog.rebertia.com
>
>
The other thing to note is that it's easy to build a generator from
os.walk() with whatever customization you like. Then the program can
use the generator the same way as he would use walk, just adding a list
of directories to skip.
For example (untested)
def mywalk(top, skipdirs=[]):
for root, dirs, files in os.walk(top):
for skipdir in skipdirs:
dirs.remove(skipdir) # don't visit this directory
yield root, dirs, files
DaveA
More information about the Python-list
mailing list