HTTrack: How to download folders only from a certain subfolder level? - httrack

HTTrack gives filter options but I cannot figure out how to download a certain subfolder level and ignore all other subfolders.
Example:
domain.com/
domain.com/pets/
domain.com/pets/elephant
domain.com/zoo/tiger
domain.com/pics/giraffe
domain.com/pics/giraffe/details
I would like to only download the subfolders elephant, tiger and giraffe as HTML including images linked from there.
Is HTTrack that powerful? (I am using the Windows GUI version "WinHTTrack".)
PS: It would be nice to have this as a program option, e.g. "Minimum mirroring depth".

I found a way how to do it:
-*
-domain.com/*[path]/*
-domain.com/*[path]
+domain.com/*[path]/*[path]/*
-domain.com/*/specialfolder*
+domain.com/*specialimages*.jpg
-mime:*/* +mime:text/html +mime:image/*
Only issue: To get all URLs it was not enough to specify the root domain but also the first level subfolders (for the example: domain.com/pets, domain.com/zoo, domain.com/pics).

Related

Warn user for creating file with same filename in same directory but also include sub directories

I'm looking for a solution that checks for the same filename when I'm downloading files, specifically through Firefox on Windows 10. I know that this feature comes standard when it comes to files in the same directory, but as the volume of files scales up, it's getting harder and harder to find what I'm looking for out of the files I've downloaded.
But since Firefox doesn't have an option to scan sub directories when saving files (nor can I find an add-on for Firefox that does something like it), I'm looking for any alternative solutions that would achieve what I'm looking for in the end: something that will notify me that I'm attempting to download (or have just downloaded) a file whose name already exists in the sub directory of a given folder, whether that be via an add-on, or some kind of application or script that can run in the background. Preferably, I would like it to check the folders inside of those sub folders as well.
My memory is terrible, so I opted to keep everything in the same folder so I would immediately get the warning when attempting to download a file I'd already downloaded. But said folder now contains far too many files for me to realistically glean through to find a particular file that I'm looking for.
I would like to be able to sort these files into sub folders of the folder I'm currently storing my downloaded files while keeping the functionality of being able to immediately tell whether or not I'm about to download something I've already downloaded. All I need is a check to see if the same filename exists upon trying to create a file (which is already a feature) - but in the sub directories as well. I do not need any functionality to actually view all the files in each sub folder in the same window.

Is there any way to filter folders/directories by name in windows?

Is there any way to filter folders by Name in Windows?
Let's suppose I have 100 or 200 folders with different names in the directory.
Now I have a list of specific folder names that are in the folders and I want to filter/move those listed name folders to another directory.
In an easy word I want to copy/move listed folder names to another folder/directory.
How can I do that with any software in bulk? I am searching for it but I can't find any software to do this task for me. I can manually move the folder one by one but it's so time-consuming when I have hundreds of folders.
I am currently using XCOPY GUI software but in this software, I have to first filter the path exception folder before I get my final result.
Example:
Main Dir contains
1st folder
2nd folder
3rd folder
4th folder
and so on ...
I have a list of names in my notepad which folders I want to copy or move.
3rd folder,
4th folder
Final Dir:
3rd folder
4th folder
How can I do that in bilk using any software or any way?
Thanks
If you're using Java, you can use methods in the Files class, or instances of File.
For example, Files.move() might help. If you're using an IDE, type stuff that might be right, like Files.move(), and suggestions will pop up that might lead you down the right path. Also, you can look at the documentation. I am currently scraping the web and creating and filing files, so I know about this somewhat, although not specifically about moving folders rather than files. I'm using Java on a MacBook, so that might influence how relevant my suggestions are to you depending on what you're using.
Basically, look at the documentation for your language. You hopefully won't need any external libraries since moving folders should be a basic task.

How to exclude a folder with CompareMerge?

On my Macbook Pro I have a purchased copy of CompareMerge (by Tien Thinh Vu ~ App-Store link: https://apps.apple.com/us/app/comparemerge/id478570084?mt=12) and want to know how to exclude a specific folder. For example the "/out" folder which does NOT need to be compared/merged when looking at 2 separate code bases of a project because it has the object files as well as other files. It just wastes time and over-populates the results when doing a compare to have it included.
CompareMerge is nice software and I like the way it works. However I can't figure out how to NOT include a folder.
Any ideas how this is accomplished?
You could input the folder name into the "Exclude (subfolder)" text box to exclude it.

Downloading Steam OS Source

Like a lot of you guys out there, I'm pretty pumped for Steam OS. I have a link to the source code, which I want to download:
http://repo.steampowered.com/steamos/
Is there an easy way for me to download all of these files?
There's no download button, and right clicking doesn't give me anything useful.
You can use wget to recursively download the directories you want.
wget -r --include-directories=steamos/ --directory-prefix=steamos/ --wait=15--reject=index.htm* "http://repo.steampowered.com/steamos/"
-r tells wget that we want to recursively download the given site.
--include-directories=steamos/ limits our download to just the steamos folder, from the root of the site. Otherwise it would try to download absolutely everything from http://repo.steampowered.com/
--directory-prefix=steamos/ specifies the folder this will be place in once its downloaded. By default, the download will be placed in 'repo.steampowered.com/steamos/'.
--reject=index.htm* junks the three index pages that would otherwise be saved to each sub-directory.
--wait=15 places a delay of 15 seconds between your downloads, for the sake of being kind to the servers.
My main reference for this was http://learningbitsandbytes.blogspot.ca/2013/07/downloading-source-code-from-svngit.html

Open Source Asset Management or File Server

I'm looking for a simple tool that would allow users to update and tag assets.
THen search/browse for assets and view the assets in the search results.
I have lots of files, i.e. logos, buttons, infographics, icons. I'd like to be able to share the with co-workers and have them be able to easily locate them without have to guess based on file names.
Right now I'm using apache with dir listing and htaccess. But this is less than ideal.
Are you talking about Version Controls? If so, Git can help.

Resources