Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

clo: Add CLI command to extract a compressed file as IR. #437

Merged
merged 106 commits into from
Jun 22, 2024
Merged
Show file tree
Hide file tree
Changes from 86 commits
Commits
Show all changes
106 commits
Select commit Hold shift + click to select a range
ad69313
Add combined file msg offset to the global metadata databases
haiqi96 May 15, 2024
d53afcd
Add combined file message offset to reader side file and File metadata
haiqi96 May 22, 2024
12d39a9
Linter fix
haiqi96 May 15, 2024
9f3c1cb
Linter fix again
haiqi96 May 22, 2024
87456f5
Fix comment
haiqi96 May 22, 2024
73c4216
address code review comments
haiqi96 May 24, 2024
437ee87
linter
haiqi96 May 24, 2024
dc56550
small fix
haiqi96 May 24, 2024
0a5cde1
Update msg_idx to message_ix for consistency
haiqi96 May 27, 2024
ad9bbd8
Update msg_idx to message_ix for consistency
haiqi96 May 27, 2024
83eaaf1
Further clean up
haiqi96 May 27, 2024
7cd1ff6
Linter
haiqi96 May 27, 2024
7700b8f
Address code review comments
haiqi96 May 27, 2024
06e3975
Add partial support for file ID & msg_ix querying
haiqi96 May 22, 2024
dd355ad
Add support for global MySQL database
haiqi96 May 22, 2024
8b09311
Replace file iterator with simpler function
haiqi96 May 22, 2024
abd1940
Get previous code's review changes.
haiqi96 May 24, 2024
75fc71b
Linter
haiqi96 May 24, 2024
f17e955
rebase and linter
haiqi96 May 27, 2024
58a28ce
Update comments and function interfaces
haiqi96 May 27, 2024
201b336
Remove extra empty line
haiqi96 May 28, 2024
66fe18e
Resolve code review concerns
haiqi96 May 29, 2024
3e0da95
Revert string view
haiqi96 May 29, 2024
3246c61
Merge branch 'main' into FileIDFilter
haiqi96 May 29, 2024
2781f8a
Replace magic number with constexpr to avoid confusion
haiqi96 May 29, 2024
356736d
Update docstring
haiqi96 May 29, 2024
8b2d2b1
Add Ir decompression api
haiqi96 May 28, 2024
aa9377f
Linter
haiqi96 May 29, 2024
85e928d
Retouch the core Archive to IR functions
haiqi96 May 28, 2024
323e80e
Merge remote-tracking branch 'origin/main' into ArchiveToIR
haiqi96 May 30, 2024
1734f98
A few more fixes
haiqi96 May 30, 2024
486cae6
Further refactor
haiqi96 May 31, 2024
24e5734
Bug fixes
haiqi96 May 31, 2024
c39941b
Small touches
haiqi96 May 31, 2024
a93b4bd
Apply suggestions from code review
haiqi96 Jun 3, 2024
a3bc3bb
Apply suggestions from code review
haiqi96 Jun 4, 2024
a04a7c9
Address code review comments and combine open into one function
haiqi96 Jun 4, 2024
29df8a3
Move constant and method to the specific file
haiqi96 Jun 4, 2024
5f6d25c
Linter
haiqi96 Jun 4, 2024
d7a3953
replace implicit import with explicit import.
haiqi96 Jun 4, 2024
631705a
linter
haiqi96 Jun 4, 2024
3cb6e11
revert unintended change
haiqi96 Jun 4, 2024
47f8dfb
Remove unnecessary import
haiqi96 Jun 4, 2024
d7ad77f
Add missing include to cmake
haiqi96 Jun 4, 2024
244446b
Remove size estimation
haiqi96 Jun 4, 2024
c4f87f8
Apply suggestions from code review
haiqi96 Jun 5, 2024
e46791c
Apply suggestions from code review
haiqi96 Jun 5, 2024
bcc4070
address code review comments
haiqi96 Jun 5, 2024
8674f6a
Add extra check
haiqi96 Jun 5, 2024
83b5385
Apply suggestions from code review
haiqi96 Jun 7, 2024
0a98c88
Apply suggestions from code review
haiqi96 Jun 7, 2024
6523fc8
Address code review concerns
haiqi96 Jun 7, 2024
30d4aae
Address more code review concerns
haiqi96 Jun 7, 2024
2f8ce12
fix silly mistake
haiqi96 Jun 7, 2024
adf9deb
last fix hopefully
haiqi96 Jun 7, 2024
f4af39c
Another fix
haiqi96 Jun 7, 2024
0a8f968
Add argument parsing for IR compression flow
haiqi96 May 29, 2024
b49f397
Retouch the IR decompression path
haiqi96 May 28, 2024
cb26f76
small fixes
haiqi96 May 31, 2024
4758933
Turn the function into a templated function. note it doesn't compile yet
haiqi96 Jun 7, 2024
0d8ef4f
Some refactor
haiqi96 Jun 7, 2024
2156987
update caller
haiqi96 Jun 7, 2024
1e3fead
Merge branch 'main' into ArchiveToIRCmd
haiqi96 Jun 8, 2024
c4a2c84
Fix
haiqi96 Jun 8, 2024
44b1b44
Apply suggestions from code review
haiqi96 Jun 11, 2024
7209ce4
Address code review concerns
haiqi96 Jun 11, 2024
8b94242
Address more code review concerns
haiqi96 Jun 11, 2024
f430fd9
Add docstrings
haiqi96 Jun 11, 2024
9b9892d
linter
haiqi96 Jun 11, 2024
1960858
Fix inconsistent indent.
kirkrodrigues Jun 11, 2024
0feef34
Fix as many clang-tidy warnings as possible; Refactor decompress_to_i…
kirkrodrigues Jun 12, 2024
d8d053e
Apply suggestions from code review
haiqi96 Jun 12, 2024
21a268c
reorder argument check
haiqi96 Jun 12, 2024
aa6a0ee
update function docstring and signature
haiqi96 Jun 12, 2024
dd91cc7
fix
haiqi96 Jun 12, 2024
228633f
Refactor CLO so it accepts a command
haiqi96 Jun 6, 2024
5642f76
Add arguments pasring for clo x
haiqi96 Jun 6, 2024
7f04a36
At least builds...
haiqi96 Jun 11, 2024
f1c4328
temp
haiqi96 Jun 11, 2024
64b43e4
Merge branch 'main' into clo-change
haiqi96 Jun 12, 2024
93c7c2c
Some refactoring
haiqi96 Jun 12, 2024
6c9c5e3
refactor the commandline arguments
haiqi96 Jun 12, 2024
d3c3447
improvement
haiqi96 Jun 12, 2024
c457669
Optimize argument printout
haiqi96 Jun 12, 2024
6d4ee79
Fix argument parsing so the outputhandler doesn't get old arguments
haiqi96 Jun 13, 2024
5c2b386
Merge branch 'main' into clo-change
haiqi96 Jun 14, 2024
6b78a11
Apply suggestions from code review
haiqi96 Jun 18, 2024
55f28a1
address code review concerns
haiqi96 Jun 18, 2024
2feca53
refactor commandline argument parsing
haiqi96 Jun 18, 2024
994aac1
Refactor CLO
haiqi96 Jun 18, 2024
cf9308d
add docstring
haiqi96 Jun 19, 2024
c528a6c
Merge branch 'main' into clo-change
haiqi96 Jun 19, 2024
6d4406e
Update cmakefile
haiqi96 Jun 19, 2024
1ea99a1
Address code review concerns
haiqi96 Jun 20, 2024
35b811c
support constant properly
haiqi96 Jun 20, 2024
50606ea
fix the search return value
haiqi96 Jun 20, 2024
bdf923a
Linter
haiqi96 Jun 20, 2024
62f309d
Clean-up clo.cpp
kirkrodrigues Jun 21, 2024
bb3cc6d
clo.cpp: Move extract_ir before search.
kirkrodrigues Jun 21, 2024
4ec9536
Alphabetize CMakelists.txt
kirkrodrigues Jun 21, 2024
e0b23ce
Extra refactoring.
kirkrodrigues Jun 21, 2024
1f39617
Rename constants.
kirkrodrigues Jun 21, 2024
4b1e541
Clean-up CLI.
kirkrodrigues Jun 21, 2024
f57c2c9
fix path creation
haiqi96 Jun 21, 2024
c1d2148
Linter
haiqi96 Jun 22, 2024
d72d223
Revert unintended change
haiqi96 Jun 22, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions components/core/src/clp/clo/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -108,11 +108,18 @@ set(
"${PROJECT_SOURCE_DIR}/submodules/sqlite3/sqlite3.c"
"${PROJECT_SOURCE_DIR}/submodules/sqlite3/sqlite3.h"
"${PROJECT_SOURCE_DIR}/submodules/sqlite3/sqlite3ext.h"
../clp/FileDecompressor.hpp
haiqi96 marked this conversation as resolved.
Show resolved Hide resolved
../clp/FileDecompressor.cpp
../ir/LogEventSerializer.cpp
../ir/LogEventSerializer.hpp
../ffi/ir_stream/encoding_methods.cpp
../ffi/ir_stream/encoding_methods.hpp
clo.cpp
CommandLineArguments.cpp
CommandLineArguments.hpp
OutputHandler.cpp
OutputHandler.hpp
utils.hpp
)

set(
Expand Down
788 changes: 495 additions & 293 deletions components/core/src/clp/clo/CommandLineArguments.cpp
kirkrodrigues marked this conversation as resolved.
Show resolved Hide resolved

Large diffs are not rendered by default.

43 changes: 41 additions & 2 deletions components/core/src/clp/clo/CommandLineArguments.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,10 @@ namespace clp::clo {
class CommandLineArguments : public CommandLineArgumentsBase {
public:
// Types
enum class Command : char {
Search = 's',
ExtractIr = 'i',
};
haiqi96 marked this conversation as resolved.
Show resolved Hide resolved
enum class OutputHandlerType : uint8_t {
Network = 0,
Reducer,
Expand All @@ -36,8 +40,28 @@ class CommandLineArguments : public CommandLineArgumentsBase {
// Methods
ParsingResult parse_arguments(int argc, char const* argv[]) override;

std::string const& get_archive_path() const { return m_archive_path; }
[[nodiscard]] auto get_command() const -> Command { return m_command; }

[[nodiscard]] auto get_archive_path() const -> std::string const& { return m_archive_path; }

// IR extraction arguments
[[nodiscard]] auto get_file_split_id() const -> std::string const& { return m_file_split_id; }

[[nodiscard]] auto get_ir_output_dir() const -> std::string const& { return m_ir_output_dir; }

[[nodiscard]] auto get_ir_temp_output_dir() const -> std::string const& {
return m_ir_temp_output_dir;
}

[[nodiscard]] auto get_ir_mongodb_uri() const -> std::string const& { return m_ir_mongodb_uri; }

[[nodiscard]] auto get_ir_mongodb_collection() const -> std::string const& {
return m_ir_mongodb_collection;
}

size_t get_ir_target_size() const { return m_ir_target_size; }
haiqi96 marked this conversation as resolved.
Show resolved Hide resolved

// Search arguments
bool ignore_case() const { return m_ignore_case; }

std::string const& get_search_string() const { return m_search_string; }
Expand Down Expand Up @@ -116,9 +140,24 @@ class CommandLineArguments : public CommandLineArgumentsBase {
);

void print_basic_usage() const override;
void print_search_basic_usage() const;
void print_extraction_basic_usage() const;

// Commands
haiqi96 marked this conversation as resolved.
Show resolved Hide resolved
Command m_command;

// Search variables
// Archive
haiqi96 marked this conversation as resolved.
Show resolved Hide resolved
std::string m_archive_path;

// Variables for IR extraction
std::string m_ir_output_dir;
std::string m_ir_temp_output_dir;
std::string m_file_split_id;
std::string m_ir_mongodb_uri;
std::string m_ir_mongodb_collection;
size_t m_ir_target_size{128ULL * 1024 * 1024};
haiqi96 marked this conversation as resolved.
Show resolved Hide resolved

// Variables for search
bool m_ignore_case;
std::string m_search_string;
std::string m_file_path;
Expand Down
4 changes: 2 additions & 2 deletions components/core/src/clp/clo/OutputHandler.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ class OutputHandler {
* metadata about the file
*/
[[nodiscard]] virtual bool can_skip_file(
[[maybe_unused]] clp::streaming_archive::MetadataDB::FileIterator const& it
[[maybe_unused]] ::clp::streaming_archive::MetadataDB::FileIterator const& it
) {
return false;
}
Expand Down Expand Up @@ -187,7 +187,7 @@ class ResultsCacheOutputHandler : public OutputHandler {
*/
ErrorCode flush() override;

[[nodiscard]] bool can_skip_file(clp::streaming_archive::MetadataDB::FileIterator const& it
[[nodiscard]] bool can_skip_file(::clp::streaming_archive::MetadataDB::FileIterator const& it
) override {
return is_latest_results_full() && get_smallest_timestamp() > it.get_end_ts();
}
Expand Down
Loading
Loading