Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-work connections and connection pool in favor of Node Pool #2188

Merged
merged 5 commits into from
Mar 18, 2024

Conversation

sidz
Copy link
Contributor

@sidz sidz commented Jan 21, 2024

This PR re-works Connection Pool, Connections and Strategies in favor of Node Pool from the official client.

As this package is built on top of official client we have to use Node Pool instead of our custom connections mechanism. What makes the whole package a bit simplier to maintain. In the future this could be re-worked again if needed. We would liek to release new version of this package as fast as possible.

ConnectionPool is replaced with Node Pool

More details can be found via link: https://www.elastic.co/guide/en/elasticsearch/client/php-api/8.12/node_pool.html.

By default Official elasticsearch client builds Simple Node Pool with RoundRobin selector and NoResurrect resurrect.

To override default Node Pool behavior do the following:

use Elastic\Transport\NodePool\Resurrect\ElasticsearchResurrect;
use Elastic\Transport\NodePool\Selector\RoundRobin;
use Elastic\Transport\NodePool\SimpleNodePool;

$nodePool = new SimpleNodePool(
    new RoundRobin(),
    new ElasticsearchResurrect()
);

new Client([
    'hosts' => [
        'https://node1.com:9200',
    ],
    'transport_config' => [
        'node_pool' => $nodePool,
    ],
]);

ClientConfiguration changes

ClientConfiguration class is a bit reworked and the following parameters have been removed:

  • port
  • path
  • url
  • connections in favor of hosts
  • servers in favor of hosts
  • roundRobin

host parameter has been renamed to hosts and it's should be an array of strings.

elastic/transport is updated to 8.8 as since that version each host could be configured with its own username/password credentials.

How to configure authentication per node.

new Client([
    'hosts' => [
        'https://username1:[email protected]:9200',
        'https://username2:[email protected]:9200',
    ],
]);

No longer needed dependecies

  • symfony/deprecation-contracts
  • guzzlehttp/psr7
  • nyholm/dsn

@sidz sidz force-pushed the drop-custom-connection-and-connection-pools branch 3 times, most recently from b5c906a to 60c6502 Compare January 21, 2024 21:13
@ruflin
Copy link
Owner

ruflin commented Jan 22, 2024

I like the simplification / reduced code base this introduces. One thing that would be useful for the discussion is @sidz is some examples on how users would do it in 7.x and how users would do some of the connection / node pool changes in the future. This should also help guide users that need to migrate to the new behaviour.

@ruflin ruflin mentioned this pull request Jan 22, 2024
6 tasks
@sidz
Copy link
Contributor Author

sidz commented Jan 22, 2024

Yeah, I'll try to add some information as part of this request @ruflin.

All what we called connections now is: hosts and as I can see user is not able to configure multiple connections with different username/password or API Keys in the official client or I didn't find how to do it yet so we are not able to do it either.

@pawelkeska
Copy link
Contributor

@sidz @ruflin i left ConnectionPoll from elastica becasue as @sidz writes in elasticsearch-php library we are not able to set multiple connections with diffrent username/password. In the current version on branch 8.x we can do this by set connections like that:

['connections'] => [
    [
       'host' => 'host1',
       'username' => 'example1'
       'password' => 'password1',
       'transport_config' => and here we can set node poll for this connection
    ],
     [
       'host' => 'host2',
       'username' => 'example2'
       'password' => 'password2',
        'transport_config' => and here we can set node poll for this connection
    ],
]

In transport_config key we can set additionally configuration Node Poll for this connection. I think is more flexible for multiple connections with diffrent username/password.

@sidz
Copy link
Contributor Author

sidz commented Jan 22, 2024

Hey @pawelkeska. Thanks for the explanation.

I've looked at what official transport provides and it seems like it is possible to use username and password as part of URI (sense 8.8.0 elastic/elastic-transport-php#22) in the following way:

$builder = TransportBuilder::create();
$build->setHosts([
    'http://user1:[email protected]',
    'http://user2:[email protected]',
]);

What is made life a bit more easier.

Current solution will work either but our Connection is not a fair single connection/node (or maybe I've missed smth) for a case where amounf of hosts more than one.

If you will open Transport class you will see that when we set an array of hosts to the Transport it will set all of them to the NodePool. So NodePool builds so many Nodes as we pass hosts to the Transport and each Node has it's own URI.

When we call sendRequest on our Client: https://github.com/ruflin/Elastica/blob/8.x/src/Client.php#L644 we will call Transports sendRequest method which works with its NodePool inside of the transport.

And Transport will try with all Nodes defined in the NodePool in the context of our one Connection. So here we have a case when our one Connection works with all Nodes/Hosts.

Ping me if it is not clear. I'll try to give more examples or re-phrase.

@pawelkeska
Copy link
Contributor

@sidz Thanks for finding a solution. I haven't seen this PR elastic/elastic-transport-php#22 before and this is good information for us. In this case we could remove our connection pool.

Yes, I agree with you, now is not a fair single connection/node and within one our connection pool we could set many hosts which build so many Nodes as we pass hosts to the Transport and each Node has it's own URI.

If we rebuild this and remove our connection poll is the best solution because we will rely on elastic-transport-php only.

Could you please make corrections in tests and prepare it for merging ?

@sidz
Copy link
Contributor Author

sidz commented Jan 23, 2024

This PR requires a bit more refactoring. I'll try to do it ASAP @pawelkeska.

@pawelkeska
Copy link
Contributor

pawelkeska commented Jan 23, 2024

@sidz Thank you for your commitment, it will help us a lot.

@ruflin
Copy link
Owner

ruflin commented Jan 23, 2024

Great conversation, not really much to add on my end. My only request is that in the end we have some notes / quicks on how Elastica users can migrate to the new mehtods.

@sidz sidz force-pushed the drop-custom-connection-and-connection-pools branch 3 times, most recently from 119af59 to a0e63b3 Compare January 25, 2024 21:09
@sidz sidz force-pushed the drop-custom-connection-and-connection-pools branch from a0e63b3 to 00b376d Compare January 25, 2024 21:14
sidz added 2 commits January 27, 2024 22:02
…esn't support PSR-17 factories but Elastic\Transport\NodePool\Node rely on psr17 factory
"elasticsearch/elasticsearch": "^8.4.1",
"guzzlehttp/psr7": "^2.0",
"nyholm/dsn": "^2.0.0",
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No longer needed as we rely on elastic/transport.

"elasticsearch/elasticsearch": "^8.4.1",
"guzzlehttp/psr7": "^2.0",
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no longer used directly

@@ -17,12 +17,9 @@
"require": {
"php": "~8.0.0 || ~8.1.0 || ~8.2.0",
"ext-json": "*",
"elastic/transport": "^8.4",
"elastic/transport": "^8.8",
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since 8.8 elastic/transport supports basic authentication per node. see elastic/elastic-transport-php#22

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I assume even if we use 8.8, Elastica will still be compatible with older versions of Elasticsearch 8.x

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes

@@ -31,9 +28,11 @@
"phpunit/phpunit": "^9.5",
"symfony/phpunit-bridge": "^6.0"
},
"conflict": {
"guzzlehttp/psr7": "<2.0.0"
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We don't have direct dependency to guzzlehttp/psr7 but without this hack --prefer-lowest installs 1.4.0 which doesn't have psr/http-factory and official transport is not able to establish connection (see https://github.com/elastic/elastic-transport-php/blob/main/src/NodePool/Node.php#L33)

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if there is a way we could add a note around this to the code somewhere. I'm worried someone comes along eventually and removes it and we don't see the side effects immediately. The good news is, that now if someone goes back to where it was introduced, they see these comments.

use Elastic\Transport\NodePool\Node;
use Elastic\Transport\NodePool\SimpleNodePool;

final class TraceableSimpleNodePool extends SimpleNodePool
Copy link
Contributor Author

@sidz sidz Jan 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For testing purposes as original node pool doesn't have getNodes method so we couldn't check different cases in our tests

@sidz sidz force-pushed the drop-custom-connection-and-connection-pools branch from ec08c3a to 6b5afdf Compare January 27, 2024 19:56
@sidz sidz force-pushed the drop-custom-connection-and-connection-pools branch from 6b5afdf to cdcf049 Compare January 27, 2024 19:59
@sidz sidz marked this pull request as ready for review January 27, 2024 20:11
@sidz
Copy link
Contributor Author

sidz commented Jan 27, 2024

Ready for review @ruflin, @pawelkeska.

While I worked on this PR I started to think that probably we need our own ClientBuilder as a replacement for ClientConfiguration.

Copy link
Owner

@ruflin ruflin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall change LGTM, it simplifies the code.

My main concern is around users migration from 7.x to 8.x. This PR touches the configs that likely every users uses which are the client configs. As you describe, port, path, connections etc. go away / are replaced. To simplify migration, could we keep some of these configs but convert these automatically for users? If services or connectiosn exists, we convert it to hosts. Same if a a port exist, we add it all hosts but log a deprecation warning and let the user know, they should adjust to hosts? This change could also be done as a follow up.

@@ -104,25 +104,6 @@ public function testConfigValue(): void
$this->assertIsArray($client->getConfigValue(['level1', 'level2']));
}

public function testAddHeader(): void
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we keep this test but adjust it to how users will add headers in the future?

@@ -17,12 +17,9 @@
"require": {
"php": "~8.0.0 || ~8.1.0 || ~8.2.0",
"ext-json": "*",
"elastic/transport": "^8.4",
"elastic/transport": "^8.8",
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I assume even if we use 8.8, Elastica will still be compatible with older versions of Elasticsearch 8.x

if (!$document->hasRetryOnConflict() && $this->_client->hasConnection() && $this->_client->getConnection()->hasParam('retryOnConflict') && ($retry = $this->_client->getConnection()->getParam('retryOnConflict')) > 0) {
$document->setRetryOnConflict($retry);
if (!$document->hasRetryOnConflict()) {
$retry = $this->_client->getConfigValue('retryOnConflict', 0);
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this directly related to the change? Maybe make it a quick second PR if not?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. because Client class doesn't have getConnection method so I had to find another way to get configuration value

@sidz
Copy link
Contributor Author

sidz commented Feb 10, 2024

To simplify migration, could we keep some of these configs but convert these automatically for users? If services or connectiosn exists, we convert it to hosts. Same if a a port exist, we add it all hosts but log a deprecation warning and let the user know, they should adjust to hosts?

IMO we need to amend previous version an add deprecation message in the previous (7.x) version instead of in the newest 8. Does it make sense @ruflin ?

@ruflin
Copy link
Owner

ruflin commented Feb 12, 2024

IMO we need to amend previous version an add deprecation message in the previous (7.x) version instead of in the newest 8.

In theory I agree. But I think the changes fall into 2 categories:

  • Changes that can be done in 7.x: We can add a deprecation message and users can already migrate to the new behaviour
  • Changes that users can't apply in 7.x yet: To make it available in 7.x, we would have to make significant changes to 7.x

I don't have a good understanding on which changes fall where. What can we easily add deprecation in 7.x and make migrate users there, lets do it there. For the ones we can't, we force users to make the changes on upgrade or for the low hanging fruits, lets make changes in 8.x. We can also wait with changes in 8.x and see what stands out for users during the migration and make it easier over time. But especially for configs which 100% of the Elastica users likely have, I think we need to do something.

@sidz
Copy link
Contributor Author

sidz commented Feb 18, 2024

But especially for configs which 100% of the Elastica users likely have, I think we need to do something.

Migration guide should work fine I think.

@ruflin
Copy link
Owner

ruflin commented Feb 19, 2024

Lets get it in and work from there. We can always add things. @pawelkeska would be great to get a second eye on this. @sidz From your end, ready to get in?

@sidz
Copy link
Contributor Author

sidz commented Feb 19, 2024

@ruflin yes. ready for review

@ruflin
Copy link
Owner

ruflin commented Mar 4, 2024

@sidz Did not forget about the review but was out / busy the last two weeks. Try to get to it this week. Would be great to get some additional reviews from others like @pawelkeska @thePanz etc.

@@ -31,9 +28,11 @@
"phpunit/phpunit": "^9.5",
"symfony/phpunit-bridge": "^6.0"
},
"conflict": {
"guzzlehttp/psr7": "<2.0.0"
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if there is a way we could add a note around this to the code somewhere. I'm worried someone comes along eventually and removes it and we don't see the side effects immediately. The good news is, that now if someone goes back to where it was introduced, they see these comments.

}

protected function _getHost(): string
{
return \getenv('ES_HOST') ?: Connection::DEFAULT_HOST;
return \getenv('ES_HOST') ?: 'localhost';
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be nice to keep some constants but not sure where to move it. Same below.

@ruflin
Copy link
Owner

ruflin commented Mar 18, 2024

Hi @sidz , it took me much longer to get to the review then it should have. Sorry about this.

I must confess, merging this PR was harder then I expected (not because of code reasons). I'm excited about the removal of all the connection code as I will simplify the maintenance of this library. Removing connection was always one of the goals why the dependency on the official client was added and finally we are here. The changes that keep bugging me are the ones where it is breaking for most users but the removal of connections requires it. How will users get to 8 in an easy way?

The answer for me is, the faster we get this in, the more feedback we get and the fewer users already jumped into 8.x and have these changes from day 1. I expect there will be more changes coming to 8.x and I'm looking forward to it together with the community to finally fully support 8.x

@sidz Thank you so much for hanging in there and keep pushing this forward. Will merge this PR now and we will keep iterating on top of it.

@ruflin ruflin merged commit b04f280 into ruflin:8.x Mar 18, 2024
14 of 15 checks passed
@sidz sidz deleted the drop-custom-connection-and-connection-pools branch March 18, 2024 09:04
@sidz
Copy link
Contributor Author

sidz commented Mar 18, 2024

How will users get to 8 in an easy way?

We could add UPGRADE.MD like Symfony does and provide all steps which users should do in order to migrate from 7 to 8

@ruflin
Copy link
Owner

ruflin commented Mar 18, 2024

We could add UPGRADE.MD like Symfony does and provide all steps which users should do in order to migrate from 7 to 8

@sidz ++

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants